Cultural Structures of Knowledge from Wikipedia Networks of First Links

التفاصيل البيبلوغرافية
العنوان: Cultural Structures of Knowledge from Wikipedia Networks of First Links
المؤلفون: Gabella, Maxime
سنة النشر: 2017
المجموعة: Computer Science
Physics (Other)
مصطلحات موضوعية: Physics - Physics and Society, Computer Science - Social and Information Networks
الوصف: Knowledge is useless without structure. While the classification of knowledge has been an enduring philosophical enterprise, it recently found applications in computer science, notably for artificial intelligence. The availability of large databases allowed for complex ontologies to be built automatically, for example by extracting structured content from Wikipedia. However, this approach is subject to manual categorization decisions made by online editors. Here we show that an implicit classification hierarchy emerges spontaneously on Wikipedia. We study the network of first links between articles, and find that it centers on a core cycle involving concepts of fundamental classifying importance. We argue that this structure is rooted in cultural history. For European languages, articles like Philosophy and Science are central, whereas Human and Earth dominate for East Asian languages. This reflects the differences between ancient Greek thought and Chinese tradition. Our results reveal the powerful influence of culture on the intrinsic architecture of complex data sets.
Comment: 12 pages, 5 figures, 6 tables. Added references. To appear in IEEE Transactions on Network Science and Engineering
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/1708.05368
رقم الأكسشن: edsarx.1708.05368
قاعدة البيانات: arXiv