====== JCR ======
https://github.com/bavla/Rnet
https://figshare.com/authors/Loet_Leydesdorff/684994
https://www.leydesdorff.net/
https://documenter.getpostman.com/view/7840038/SzYaVdeo?version=latest
https://www.leydesdorff.net/jcr07/cited/
https://www.aminer.org/citation
https://www.citnetexplorer.nl/
https://harzing.com/resources/publish-or-perish
https://linqs.soe.ucsc.edu/data
https://www.slideshare.net/MasoudMohammadi5/citation-analysis-85720471
https://github.com/rcsb/BioCaddiePilot32/blob/master/src/main/resources/NetworkAnalysis.md
https://www.bibliometrix.org/
https://networkrepository.com/cit.php
===== Combining JCR networks into a single network for each year =====
Combine networks from https://www.leydesdorff.net/jcr07/cited/ into a single network for each year.
At the first sight it seems that the files are numbered with consecutive numbers. But it turned out that this assumption doesn't hold.
Cited 2007:
Aapg Bulletin
Aaps Journal
Aaps Pharmscitech
Aatcc Review
Abacus A Journal Of Accounting Finance And Business Studies
Abdominal Imaging
...
Zuckerindustrie
Zygon
Zygote
I cut-out from the html description of the page https://www.leydesdorff.net/jcr07/cited/ the list of all files and saved it in the file ''list07.txt''. From it I extracted in R the corresponding file names.
> wdir <- "C:/Users/vlado/work2/data/nets/loet"
> setwd(wdir)
> source("Pajek.R")
>
> # https://www.leydesdorff.net/jcr07/cited/v4.txt
>
> cat("Start:",format(Sys.time(), "%H:%M:%S"),"\n")
> durl <- "https://www.leydesdorff.net/jcr07/cited/"
> L <- readLines("list07.txt"); L <- L[tolower(substr(L,1,2))==" S <- unlist(strsplit(L,'"')); F <- tolower(S[3*(1:length(L))-1])
> m <- 0; mmax <- 800000; ner <- 0
> U <- rep(NA,mmax); V <- rep(NA,mmax); W <- rep(NA,mmax)
> N <- c(); N["§{@@@@@@@@}§"] <- 0; j <- 0
> for(f in F){
+ j <- j+1
+ if(j %% 100==0) {cat(j,":",format(Sys.time(), "%H:%M:%S"),"\n"); flush.console()}
+ page <- paste(durl,f,sep='')
+ M <- net2matrix(page,warn=-1)
+ if(is.na(M)){ner <- ner+1
+ cat("\n",j,"error in file",page,"\n"); flush.console(); next
+ }
+ nn <- nrow(M); Nam <- row.names(M)
+ for(u in 1:nn) for(v in u:nn){
+ if(M[u,v]!=0){
+ indu <- N[Nam[u]]; if(is.na(indu)) indu <- N[Nam[u]] <- length(N)
+ indv <- N[Nam[v]]; if(is.na(indv)) indv <- N[Nam[v]] <- length(N)
+ m <- m+1; U[m] <- indu; V[m] <- indv; W[m] <- M[u,v]
+ }
+ }
+ }
> cat("end:",format(Sys.time(), "%H:%M:%S"),"\n",ner," errors\n")
>
> uvLab2net(names(N)[2:length(N)],U[1:m],V[1:m],W[1:m],Net="JCR07.net")
The program was interrupted
Error in file(file, "r") : cannot open the connection
To continue the network construction in the interruption point I entered a slightly changed part of the program (valid also for following interruptions)
> cat("Start:",format(Sys.time(), "%H:%M:%S"),"\n")
> F1 <- F[j:length(F)]; j <- j-1
> for(f in F1){
+ j <- j+1
...
+ }
+ }
> cat("end:",format(Sys.time(), "%H:%M:%S"),"\n",ner," errors\n")
The obtained network ''JCR*.net'' has many multiple links with different weights. In Pajek I transform it into the corresponding simple network ''JCR*s.net'' with the maximum weight on each link.
Network Nodes LinksC LinksS AvDegree
JCR4 7251 463056 109036 30.075
JCR5 7397 549421 120835 32.671
JCR6 7487 543443 119713 31.979
JCR7 7769 552335 124208 31.975
''LinksC'' is the number of links in the network ''JCR*.net'' and ''LinksS'' is the number of links in the network ''JCR*s.net''. ''AvDegree'' is the average degree in the network ''JCR*s.net''.
The Pajek NET files are available at [[https://github.com/bavla/Nets/tree/master/data/JCR|Github/Bavla]].