====== Project 2 ======
Select a network data set from the following list or from somewhere else ([[pajek:ev:pde:sources|sources]], your own network) or the k-th time slice from [[http://vlado.fmf.uni-lj.si/pub/networks/data/CRA/terror.htm|Reuters terror news]] where k is your number from the students' list. The selected network should have at least 500 **labeled** nodes. To prevent duplication, **send a reservation to my e-mail**.
^ n ^ student ^ network ^
| 1 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/collab/geom.htm|Collaboration network in computational geometry]] |
| 2 | Budong Dong | [[http://vlado.fmf.uni-lj.si/pub/networks/data/econ/Eva/Eva.htm|EVA - Extraction, Visualization & Analysis of corporate inter-relationships]] |
| 3 | | [[pajek:nets:mix:euroad|Euro roads]] |
| 4 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/dic/odlis/Odlis.htm|ODLIS - Online Dictionary of Library and Information Science]] |
| 5 | Irina Pavlova | [[https://github.com/bavla/Nets/tree/master/data/JCR|Leydesdorff's citations JCR7]] |
| 6 | | [[http://www.mas.ncl.ac.uk/~ncsg3/blog/dependencies.csv|R Dependency Network]] |
| 7 | | [[https://github.com/BuzzFeedNews/trumpworld|TrumpWorld associations (2017)]] |
| 8 | Jiaxuan Wang | [[http://opsahl.co.uk/tnet/datasets/USairport_2010.dl|US airport networks (2010)]]; [[http://opsahl.co.uk/tnet/datasets/USairport_2010_codes.txt|Airport codes]] |
| 9 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/bio/Yeast/Yeast.htm|Protein-protein interaction network in budding yeast]] |
| 10 | | [[http://tuvalu.santafe.edu/~simon/styled-9/styled-10/|Wikipedia's Norm Network]] |
| 11 | | [[https://github.com/bavla/Rnet/tree/master/net/mRel/London|London transport]] |
| 12 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/mix/mixed.htm|polBlogs]] |
| 13 | | [[https://medium.com/swlh/analyzing-references-in-bibles-verses-using-complex-networks-with-pandas-and-gephi-8a4edc52e7ab|Bible references]] / [[https://github.com/edusrmt/one-figure-projects/blob/master/datasets/translated_references.csv|network data]] |
| 14 | | [[https://www.kaggle.com/datasets/simonburton/wikipedia-mathematicians|Wikipedia mathematicians]] |
| 15 | | [[https://www.kaggle.com/datasets/csanhueza/the-marvel-universe-social-network|Marvel]] |
| 17 | | [[pajek:data:pajek:som|SOM - self organizing maps citation network]] |
| 18 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/cite/SciMet.zip|Scientometrics 1978-2011 citation network]], [[http://vlado.fmf.uni-lj.si/pub/networks/data/cite/default.htm|Info]] |
| 19 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/esna/IsleofMan.htm|Isle of Man genealogies]] |
| 20 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data//GED/Presdnts.GED|US presidents genealogies]] |
| 21 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/esna/ragusa.htm|Ragusan nobility genealogy]] |
| 22 | | [[pajek:data:link:az04|Amazon books 2004]] |
| 23 | | [[pajek:data:link:sn5|SN5cite from SN5 - Works on SNA from WoS till 2007]] |
| 24 | | [[http://vlado.fmf.uni-lj.si/pub/networks/data/dic/fa/FreeAssoc.htm|PairsP from FreeAssoc - The USF word association, rhyme, and word fragment norms]] |
| 25 | | [[https://www.kaggle.com/datasets/ruchi798/star-wars|starwars-full]] |
| 26 | | [[|]] |
[[https://docs.google.com/spreadsheets/d/1hsR2_BUqDh8zEAxTBX1dK5auZm-Mwv-JyEXAI70WsSk/edit#gid=0|Google spreadsheet]]
If the selected network is not in Pajek's format you need first to convert it ([[ru:hse:rnet:hints|example]]).
For the selected network, using Pajek:
- determine basic network characteristics (directed/undirected, loops, multiple links; weights?; the number of nodes, number of links, number of components; largest degree, diameter, acyclic?, bow-tie composition for directed, ...).
- draw the degree (in directed also indegree and outdegree) distribution. List the top 20 nodes of the largest (in/out) degree.
- in a directed network:
- number of strong components; if many, their size distribution;
- condensation; depth of condensation
- in an undirected network the largest (weak) component / in a directed network extract the largest strong component. For it compute the standard importance measures (degree, betweenness, closeness, corrected clustering coefficient; and in a directed network also hubs and authorities). For each measure determine the top 20 nodes.
- determine the cores in your network. Extract and draw the largest core with at most 100 nodes.
- determine some interesting link islands of your network, draw and comment on them. If your network is not weighted select some measure of the importance of links ([[pajek:ev:pde:hints|hints]]) and compute the weights. Interpret the results.
Write a report. Attach also a ZIP with your network data.
\\
\\ \\
====== ======
[[ru:hse:nets22:pro|Projects 2022]]