====== Scopus ======

[[vlado:work:nm:pajek|Converting]] Nataliya's CSV data from Scopus into networks I noticed that the records contain besides the author name also its **Scopus Author Identifier** (and for journals their ISSNs).
  * [[https://service.elsevier.com/app/answers/detail/a_id/11212/|What is the Scopus Author Identifier?]]
  * [[https://tutorials.scopus.com/EN/AuthorDetails/sc_AuthorDetails_textOnly.html|Scopus features on the Author details page]]
  * [[https://rmit.libguides.com/c.php?g=677033&p=6588680#:~:text=Scopus%20Author%20ID%20is%20automatically,%2C%20publications%20and%20co%2Dauthors|LibGuides Scopus Author ID]]; [[https://guides.hsict.library.utoronto.ca/c.php?g=429846&p=2931291|UHN Library and Information Services]]
  * Jeroen Baas, Michiel Schotten, Andrew Plume, Grégoire Côté, Reza Karimi: [[https://direct.mit.edu/qss/article/1/1/377/15571/Scopus-as-a-curated-high-quality-bibliometric-data|Scopus as a curated, high-quality bibliometric data source for academic research in quantitative science studies]]. Quantitative Science Studies (2020) 1 (1): 377–386.
  * Anne-Wil Harzing: [[https://link.springer.com/article/10.1007/s11192-015-1699-y|Health warning: Might contain multiple personalities]]. The problem of homonyms in Thomson Reuters Essential Science Indicators. Scientometrics 105(3):2259-2270.
This provides a (at least partial) solution to the author's (and journal's) identification problem. 

===== Data from Scopus =====

On the computer from the university/research domain (from outside using VPN), we visit the [[https://izum.si/servisi-in-storitve/|IZUM services]] page and click on the option Scopus. In Scopus, we enter the query determining the works of our interest. For example, the query **''k-truss AND ( graph OR network )''**


{{vlado:proj:bib:pics:queryA.jpg?800}}

returns the following result

{{vlado:proj:bib:pics:queryB.jpg?800}}

Selecting all hits (red square) and the option export we get

{{vlado:proj:bib:pics:exportA.jpg?800}}

and afterward selecting the CSV option we get a window for selecting exported fields. We select all options in the first three columns, the **''Include references''** option in the last column, and switch off the option **''Truncate to optimize for Excel''**.
 
{{vlado:proj:bib:pics:exportB.jpg?800}}

Clicking on the button **Export** we start the creation of the CSV file with our hits. In a similar way, we exported the hits also in RIS, BiBTeX, and plain text format. They are all available in the
[[http://vladowiki.fmf.uni-lj.si/lib/exe/fetch.php?media=vlado:proj:bib:data:scopus-truss.zip|scopus-truss.zip]] file.

Up to 20000 hits can be saved at once.

===== To do: Scopus2Pajek =====

Because Scopus provides besides the names of units (authors, journals, works) their IDs it simplifies the identification problem for them. We would need a program ''Scopus2Pajek'' similar to ''WoS2Pajek'' ([[https://github.com/bavla/biblio/tree/master/WoS2Pajek|GitHub/Bavla WoS2Pajek]], [[https://github.com/bavla/biblio/blob/master/WoS2Pajek/WoS2Pajek14.pdf|Slides]], [[https://github.com/bavla/biblio/blob/master/WoS2Pajek/WoS2Pajek15.pyw|Python]]) for converting Scopus data into Pajek network files - a combination of ''WoS2Pajek'' and [[http://vladowiki.fmf.uni-lj.si/doku.php?id=vlado:work:nm:pajek|solutions in R]].

There exists a program [[https://github.com/bavla/biblio/tree/master/Scopus2WoS|Scopus2WoS]] for the conversion of Scopus data in RIS format into WoS format. We selected the RIS format because it is very close to the WoS format. Unfortunately, it was a bad decision - in the RIS file, the IDs are not included. 

In the conversion, we encountered the problem of [[http://vladowiki.fmf.uni-lj.si/doku.php?id=vlado:notes:txt:er:nonascii|replacing nonASCII characters with ASCII characters]].