====== Project 2 ====== Select a network data set from the following list or from somewhere else ([[pajek:ev:pde:sources|sources]], your own network) or the k-th time slice from [[http://vlado.fmf.uni-lj.si/pub/networks/data/CRA/terror.htm|Reuters terror news]] where k is your number from the students' list. The selected network should have at least 500 **labeled** nodes. To prevent duplication, **send a reservation to my e-mail**. ^ n ^ student ^ network ^ | 1 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/collab/geom.htm|Collaboration network in computational geometry]] | | 2 | Budong Dong | [[http://vlado.fmf.uni-lj.si/pub/networks/data/econ/Eva/Eva.htm|EVA - Extraction, Visualization & Analysis of corporate inter-relationships]] | | 3 | | [[pajek:nets:mix:euroad|Euro roads]] | | 4 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/dic/odlis/Odlis.htm|ODLIS - Online Dictionary of Library and Information Science]] | | 5 | Irina Pavlova | [[https://github.com/bavla/Nets/tree/master/data/JCR|Leydesdorff's citations JCR7]] | | 6 | | [[http://www.mas.ncl.ac.uk/~ncsg3/blog/dependencies.csv|R Dependency Network]] | | 7 | | [[https://github.com/BuzzFeedNews/trumpworld|TrumpWorld associations (2017)]] | | 8 | Jiaxuan Wang | [[http://opsahl.co.uk/tnet/datasets/USairport_2010.dl|US airport networks (2010)]]; [[http://opsahl.co.uk/tnet/datasets/USairport_2010_codes.txt|Airport codes]] | | 9 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/bio/Yeast/Yeast.htm|Protein-protein interaction network in budding yeast]] | | 10 | | [[http://tuvalu.santafe.edu/~simon/styled-9/styled-10/|Wikipedia's Norm Network]] | | 11 | | [[https://github.com/bavla/Rnet/tree/master/net/mRel/London|London transport]] | | 12 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/mix/mixed.htm|polBlogs]] | | 13 | | [[https://medium.com/swlh/analyzing-references-in-bibles-verses-using-complex-networks-with-pandas-and-gephi-8a4edc52e7ab|Bible references]] / [[https://github.com/edusrmt/one-figure-projects/blob/master/datasets/translated_references.csv|network data]] | | 14 | | [[https://www.kaggle.com/datasets/simonburton/wikipedia-mathematicians|Wikipedia mathematicians]] | | 15 | | [[https://www.kaggle.com/datasets/csanhueza/the-marvel-universe-social-network|Marvel]] | | 17 | | [[pajek:data:pajek:som|SOM - self organizing maps citation network]] | | 18 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/cite/SciMet.zip|Scientometrics 1978-2011 citation network]], [[http://vlado.fmf.uni-lj.si/pub/networks/data/cite/default.htm|Info]] | | 19 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/esna/IsleofMan.htm|Isle of Man genealogies]] | | 20 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data//GED/Presdnts.GED|US presidents genealogies]] | | 21 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/esna/ragusa.htm|Ragusan nobility genealogy]] | | 22 | | [[pajek:data:link:az04|Amazon books 2004]] | | 23 | | [[pajek:data:link:sn5|SN5cite from SN5 - Works on SNA from WoS till 2007]] | | 24 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/dic/fa/FreeAssoc.htm|PairsP from FreeAssoc - The USF word association, rhyme, and word fragment norms]] | | 25 | | [[https://www.kaggle.com/datasets/ruchi798/star-wars|starwars-full]] | | 26 | | [[|]] | [[https://docs.google.com/spreadsheets/d/1hsR2_BUqDh8zEAxTBX1dK5auZm-Mwv-JyEXAI70WsSk/edit#gid=0|Google spreadsheet]] If the selected network is not in Pajek's format you need first to convert it ([[ru:hse:rnet:hints|example]]). For the selected network, using Pajek: - determine basic network characteristics (directed/undirected, loops, multiple links; weights?; the number of nodes, number of links, number of components; largest degree, diameter, acyclic?, bow-tie composition for directed, ...). - draw the degree (in directed also indegree and outdegree) distribution. List the top 20 nodes of the largest (in/out) degree. - in a directed network: - number of strong components; if many, their size distribution; - condensation; depth of condensation - in an undirected network the largest (weak) component / in a directed network extract the largest strong component. For it compute the standard importance measures (degree, betweenness, closeness, corrected clustering coefficient; and in a directed network also hubs and authorities). For each measure determine the top 20 nodes. - determine the cores in your network. Extract and draw the largest core with at most 100 nodes. - determine some interesting link islands of your network, draw and comment on them. If your network is not weighted select some measure of the importance of links ([[pajek:ev:pde:hints|hints]]) and compute the weights. Interpret the results. Write a report. Attach also a ZIP with your network data. \\ \\ \\ ====== ====== [[ru:hse:nets22:pro|Projects 2022]]