Prashanth Ellina made a great work extracting topics using wikipedia data. Using the Graphviz program, he shows “…the wealth of information (both as text and as interconnects)” in wikipedia. The graphs are of incomparable complexity and might be not so easy to decode (see an example below, more high quality images in Prashanth’s blog), but remind me of some maps of the web I stumbled upon some time ago (the internet mapping project). It seems to me that even wikipedia obeys to power laws.
It makes me wonder: if some articles in wikipedia get more links than others, can this be considered as an authority breach? Are some articles considered more authoritative or just more popular?
Here is a link he suggests in understanding the interconnection between wikipedia categories.






Stephen North responded on 01 Jan 2008 at 1:59 am #
Hi. I’d suggest at least trying neato or fdp for layouts of these networks, particularly, neato -Gmodel=subset