wikimap-small A new visualization Bruce Herr and I recently completed is being featured in this week’s New Scientist Magazine (the article is free online, minus the viz).  They did a good job jazzing up the language used to describe the viz–’power struggle’, ‘bubbling mass’, ‘blitzed articles’–but they also dumbed down the technical accomplishments.  I guess not everyone gets as excited about algorithms as I do. 

Before I talk anymore about the viz, though, let me mention its appearing at the NetSci 2007 Conference this week, and hopefully a varient will appear at Wikimania later this summer as well.  The viz is a huge 5 feet by 5 feet when printed, and I only include a low res, smaller version here.  At some point high quality art prints of it will appear at SciMaps for sale to fund further visualization research.

Now for the good stuff.  Much like my visualization of the netflix prize competition data, we began this piece by representing the data as a network.  In this case the nodes in the network are wikipedia articles and the edges are the links between articles.  We then (with some help from our friends at Sandia) used an algorithm to lay out all 650,000 nodes (wikipedia articles) that had at least one link in such a way that similar articles are near one another.  These are the yellow dots, which when viewed at low res give a yellow tint to the whole picture.

The sizes of the nodes (circles, dots, whatever you want to call them), are based on a model of revision activity.  So large circles indicate that an article might be controversial, or the subject of lots of vandalism, or just a topic whose content frequently changes.  We labeled only the largest nodes, to keep it readable.  There is an interactive version of this in the works based on the google maps platform which will change the labels and pictures used as the user ‘zooms’ in or out.  Stay tuned for that.

The image used for each tile was selected automatically, simply by using the first image in the most linked to article among all the articles in that tile.  We were pleasantly surprised by the quality of the images that appeared.   

Our hope for this visualization approach, which we continue to improve on, is that it could be updated in real time to give a macro sense of what is happening in Wikipedia.  I personally hope that some variation of it will end up in high schools as a teaching tool and for generating discussions.

Top 20 Most Hotly Revised Articles

  • Jesus
  • Adolf Hitler
  • October 2003
  • Nintendo revolution
  • Hurricane Katrina
  • India
  • RuneScape
  • Anarchism
  • Britney Spears
  • PlayStation 3
  • Saddam Hussein
  • Japan
  • Albert Einstein
  • 2004 Indian Ocean Earthquake
  • New York City
  • Germany
  • Muhammad
  • Pope Benedict XVI
  • Ronald Regan
  • Hinduism 


Comments

17 Comments so far

  1. Bertalan Meskó on May 22, 2007 3:23 pm

    Great work! Do you plan to work on a specific topic like medicine?

  2. Lucas on May 23, 2007 5:22 am

    Awesome! How did you find how often they were edited?

  3. Tadpole on May 23, 2007 8:32 am

    That’s awesome. I think it’s fascinating to be able to make such stunning visualizations as this. I also find it kind of cool that in the scheme of things, I played a part since I have contributed to Wikipedia, though my part is small, it’s there and that’s kind of cool.

  4. Craig on May 23, 2007 8:33 am

    Very interesting stuff… I’d be curious to see something like this done for other large sites, like Digg or MySpace…

  5. Michael on May 27, 2007 5:18 pm

    Really cool!

    I didn’t expect October 2003 being on the list of “Hotly Revised Articles”.

  6. sj on May 29, 2007 9:16 pm

    You might want to update your algorithm to weigh recent changes a bit more heavily; and to discount bot edits if you don’t already. Through 2004 there were some bots that weren’t flagged as such.

    October 2003 was the last month for which the [[Current Events]] page kept on rolling over from month to month; it was finally moved to [[October 2003]] and a new naming system started. So all edits to the current events page from 2001 to October 2003 get counted in that total.

    “Muhammid” (in your last sidebar above) is I hope a misspelling!

    Lovely work. I want to know how I can get a large-scale version printed out for the Boston Wikipedia group…

    SJ

  7. Richard Morris on June 4, 2007 4:54 pm

    Lovely image it would be great to see more of it in detail.

    I’m wondering about the licence for this image. As most images on wikipedia are released under a sharealike licence I guess this counts as a derived work so GFDL would seem appropriate.

    Rich (Salix alba on wikipedia)

  8. Stephen Holmes on July 10, 2007 5:18 am

    It’d be great if it could create this on-the-fly for individual topics that you searched Wiki for, eg: showing everything that featured within ‘Technology’.

  9. dkr on July 21, 2007 2:22 pm

    The above-mentioned zoomable version exists now:
    http://scimaps.org/maps/wikipedia/

    I found it amusing how much anime was represented. e.g. Ghost in the Shell(with a picture of Motoko) is diagonal from Albert Einstein, and Cowboy Bebop and the Big O are nearby.

  10. frantik on August 14, 2007 7:32 pm

    I like how cheese is right next to Jesus, the Bible, and a bunch of other religious oriented stuff :D

  11. 1389 on August 15, 2007 1:02 am

    The juxtaposition of the first two items on the list somehow gives me a truly eerie feeling…

  12. Sean D. on August 28, 2007 4:46 pm

    Any update on when/whether the “high quality art print” will be available?

  13. Visualizing Science & Tech Activity in Wikipedia : A Beautiful WWW on October 2, 2007 8:55 pm

    […] Visualizing the ‘Power Struggle’ in Wikipedia […]

  14. Bruce Herr on November 6, 2007 5:12 pm

    There are several versions of the wikipedia visualization available for purchase at http://scimaps.org/ordermaps/

    Also, a google maps version of the places and spaces version of the wikipedia visualization is available here: http://www.scimaps.org/maps/wikipedia-ps3/ It shows the math, science, and technology related articles contained in wikipedia.

    Also also, we are working on an updated version with newer data and a paper describing our work :)

  15. hanteng on December 29, 2007 10:32 am

    What should I need to learn and know if I am trying to duplicate this process on Chinese Wikipedia? It is very interesting and I would love to see the results when this applied to other versions…. Is it possible for me to do this as part of my non-commercial PhD work in Oxford?

  16. Aaron Osborn on March 16, 2008 10:24 am

    Hi,
    I think this is an incredible project.
    I am trying to build an interactive structure along the same lines, but much different.
    I am an artist by trade, and love programming.

    Can you inform me of current work?

    Also, I am looking for someone to collaborate with.

    Anyone interested?

    Great work, keep it going!

  17. Kathy Riley on May 6, 2008 6:28 pm

    Hi there,

    i’m a writer for Australian Geographic magazine - very interested in reproducing your “Top 20 most hotly revised pages in Wikipedia” list. Can you pls email me ASAP?

    Many thanks
    Kathy

Name (required)

Email (required)

Website

Speak your mind









Admin