Cite - bavla/biblio GitHub Wiki

Citation analysis

Cite indegree

Cite.net

==============================================================================
1. C:\Mail.Ru Cloud\ANR HSE\ANR Projects\Scientometrics\Data\WoS2Pajek\Cite.net (484667)
==============================================================================
Number of vertices (n): 484667
----------------------------------------------------------
                                       Arcs          Edges
----------------------------------------------------------
Total number of lines                818538              0
----------------------------------------------------------
Number of loops                          29              0
Number of multiple lines               2847              0
----------------------------------------------------------
Density [loops allowed] = 0.00000348
Average Degree = 3.37773358

Remove loops, multiple lines to single lines. Saved as Cite_simpl

==============================================================================
3. Cite_simpl (484667)
==============================================================================
Number of vertices (n): 484667
----------------------------------------------------------
                                       Arcs          Edges
----------------------------------------------------------
Total number of lines                815666              0
----------------------------------------------------------
Number of loops                           0              0
Number of multiple lines                  0              0
----------------------------------------------------------

Density1 [loops allowed]    = 0.00000347
Density2 [no loops allowed] = 0.00000347
Average Degree = 3.36588214

Indegree vec save as separate file - Cite_Indegree.vec

  • 50 works with largest indegree:
==============================================================================
1. Input Degree of N3 (484667)
==============================================================================
Dimension: 484667
The lowest value:                         0.0000
The highest value:                     2085.0000

Highest values: 

      Rank    Vertex                       Value   Id
--------------------------------------------------------
         1        62                   2085.0000   HIRSCH_J(2005)102:16569
         2       703                    774.0000   SMALL_H(1973)24:265
         3      8023                    772.0000   ZIPF_G(1949):
         4      3691                    616.0000   LOTKA_A(1926)16:317
         5     95847                    572.0000   BRADFORD_M(1976)72:248
         6       717                    557.0000   VANECK_N(2010)84:523
         7      2433                    551.0000   GARFIELD_E(1972)178:471
         8       496                    527.0000   EGGHE_L(2006)69:131
         9      3704                    526.0000   PRITCHAR_A(1969)25:348
        10      5917                    495.0000   GARFIELD_E(1955)122:108
        11      2959                    476.0000   PRICE_D(1963):
        12       682                    458.0000   PRICE_D(1965)149:510
        13      2432                    447.0000   GARFIELD_E(2006)295:90
        14      2357                    439.0000   GARFIELD_E(1979):
        15      2455                    407.0000   SEGLEN_P(1997)314:498
        16       491                    404.0000   CHEN_C(2006)57:359
        17       236                    402.0000   MOED_H(2005):
        18       723                    395.0000   WHITE_H(1998)49:327
        19       722                    380.0000   WHITE_H(1981)32:163
        20      6044                    376.0000   KATZ_J(1997)26:1
        21      5530                    365.0000   WASSERMA_S(1994):
        22      1460                    359.0000   KESSLER_M(1963)14:10
        23      2162                    355.0000   MERTON_R(1968)159:56
        24      1172                    353.0000   FALAGAS_M(2008)22:338
        25      8119                    325.0000   BRADFORD_S(1934)137:85
        26      5644                    301.0000   VANRAAN_A(2006)67:491
        27      4715                    300.0000   NEWMAN_M(2001)98:404
        28        61                    297.0000   HIRSCH_J(2007)104:19193
        29       655                    297.0000   MCCAIN_K(1990)41:433
        30      1915                    296.0000   BORGATTI_S(2002):
        31     25852                    292.0000   BARABASI_A(1999)286:509
        32       318                    284.0000   MEHO_L(2007)58:2105
        33       328                    277.0000   PRICE_D(1976)27:292
        34      6015                    276.0000   NEWMAN_M(2004)101:5200
        35      7305                    274.0000   FREEMAN_L(1979)1:215
        36       170                    259.0000   BORNMANN_L(2008)64:45
        37      5226                    257.0000   ALONSO_S(2009)3:273
        38     36871                    257.0000   NEWMAN_M(2005)46:323
        39      6130                    256.0000   WATTS_D(1998)393:440
        40     36875                    255.0000   SIMON_H(1955)42:425
        41     17065                    253.0000   EGGHE_L(1990):
        42       689                    242.0000   RAMOS-RO_A(2004)25:981
        43      9161                    241.0000   BORNER_K(2003)37:179
        44     17840                    241.0000   JIN_B(2007)52:855
        45      5031                    239.0000   KING_D(2004)430:311
        46       581                    239.0000   CRANE_D(1972):
        47     15666                    230.0000   BRAUN_T(2006)69:169
        48      5441                    227.0000   SMALL_H(1974)4:17
        49      3825                    226.0000   THELWALL_M(2013)8:0064841
        50     15702                    223.0000   MOED_H(1995)33:381
--------------------------------------------------------
Sum (all values):                    815666.0000
1		2085.0000	HIRSCH_J(2005)102:16569	Hirsch, JE	 An index to quantify an individual's scientific research output	P NATL ACAD SCI USA	2005
2		774.0000	SMALL_H(1973)24:265	SMALL, H	 COCITATION IN SCIENTIFIC LITERATURE - NEW MEASURE OF RELATIONSHIP BETWEEN 2 DOCUMENTS	*****	1973
3		772.0000	ZIPF_G(1949):	Zipf, George Kingsley	 Human Behavior And The Principle Of Least Effort	*****	1949
4		616.0000	LOTKA_A(1926)16:317	Lotka, A.J. 	 The Frequency Distribution of Scientific Productivity 	*****	1926
5		572.0000	BRADFORD_M(1976)72:248	BRADFORD, MM	 RAPID AND SENSITIVE METHOD FOR QUANTITATION OF MICROGRAM QUANTITIES OF PROTEIN UTILIZING PRINCIPLE OF PROTEIN-DYE BINDING	*****	1976
6		557.0000	VANECK_N(2010)84:523	van Eck, NJ	 Software survey: VOSviewer, a computer program for bibliometric mapping	SCIENTOMETRICS	2010
7		551.0000	GARFIELD_E(1972)178:471	GARFIELD, E	 CITATION ANALYSIS AS A TOOL IN JOURNAL EVALUATION - JOURNALS CAN BE RANKED BY FREQUENCY AND IMPACT OF CITATIONS FOR SCIENCE POLICY STUDIES	*****	1972
8		527.0000	EGGHE_L(2006)69:131	Egghe, L	 Theory and practise of the g-index	SCIENTOMETRICS	2006
9		526.0000	PRITCHAR_A(1969)25:348	PRITCHARD, A	 STATISTICAL BIBLIOGRAPHY OR BIBLIOMETRICS	J DOC	1969
10		495.0000	GARFIELD_E(1955)122:108	GARFIELD, E	 CITATION INDEXES FOR SCIENCE - NEW DIMENSION IN DOCUMENTATION THROUGH ASSOCIATION OF IDEAS	*****	1955
11		476.0000	PRICE_D(1963):	Price, Derek J. 	 Little Science, Big Science and Beyond	*****	1963
12		458.0000	PRICE_D(1965)149:510	PRICE, DJD	 NETWORKS OF SCIENTIFIC PAPERS	*****	1965
13		447.0000	GARFIELD_E(2006)295:90	Garfield, E	 The history and meaning of the journal impact factor	*****	2006
14		439.0000	GARFIELD_E(1979):	Garfield, Eugene  	 Citation Indexing - Its Theory and Application in Science, Technology, and Humanities 	*****	1979
15		407.0000	SEGLEN_P(1997)314:498	Seglen, PO	 Why the impact factor of journals should not be used for evaluating research	*****	1997
16		404.0000	CHEN_C(2006)57:359	Chen, CM	 CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature	*****	2006
17		402.0000	MOED_H(2005):	Moed, Henk F.	 Citation Analysis in Research Evaluation	*****	2005
18		395.0000	WHITE_H(1998)49:327	White, HD	 Visualizing a discipline: An author co-citation analysis of information science, 1972-1995	J AM SOC INFORM SCI	1998
19		380.0000	WHITE_H(1981)32:163	WHITE, HD	 AUTHOR COCITATION - A LITERATURE MEASURE OF INTELLECTUAL STRUCTURE	*****	1981
20		376.0000	KATZ_J(1997)26:1	Katz, JS	 What is research collaboration?	*****	1997
21		365.0000	WASSERMA_S(1994):	Wasserman, S.	 Social Network Analysis: Methods and Applications	*****	1994
22		359.0000	KESSLER_M(1963)14:10	KESSLER, MM	 BIBLIOGRAPHIC COUPLING BETWEEN SCIENTIFIC PAPERS	AM DOC	1963
23		355.0000	MERTON_R(1968)159:56	MERTON, RK	 MATTHEW EFFECT IN SCIENCE	*****	1968
24		353.0000	FALAGAS_M(2008)22:338	Falagas, ME	 Comparison of PubMed, Scopus, Web of Science, and Google Scholar: strengths and weaknesses	*****	2008
25		325.0000	BRADFORD_S(1934)137:85	Bradford, S.C.	 Sources of information on specific subjects 	*****	1934
26		301.0000	VANRAAN_A(2006)67:491	Van Raan, AFJ	 Comparison of the Hirsch-index with standard bibliometric indicators and with peer judgment for 147 chemistry research groups	SCIENTOMETRICS	2006
27		300.0000	NEWMAN_M(2001)98:404	Newman, MEJ	 The structure of scientific collaboration networks	*****	2001
28		297.0000	HIRSCH_J(2007)104:19193	Hirsch, JE	 Does the h index have predictive power?	P NATL ACAD SCI USA	2007
29		297.0000	MCCAIN_K(1990)41:433	MCCAIN, KW	 MAPPING AUTHORS IN INTELLECTUAL SPACE - A TECHNICAL OVERVIEW	*****	1990
30		296.0000	BORGATTI_S(2002):	Borgatti, S. P.	 Ucinet 6 for Windows: Software for Social Network Analysis 	*****	2002
31		292.0000	BARABASI_A(1999)286:509	Barabasi, AL	 Emergence of scaling in random networks	*****	1999
32		284.0000	MEHO_L(2007)58:2105	Meho, LI	 Impact of data sources on citation counts and rankings of LIS faculty: Web of science versus scopus and google scholar	J AM SOC INF SCI TEC	2007
33		277.0000	PRICE_D(1976)27:292	PRICE, DJD	 GENERAL THEORY OF BIBLIOMETRIC AND OTHER CUMULATIVE ADVANTAGE PROCESSES	J AM SOC INFORM SCI	1976
34		276.0000	NEWMAN_M(2004)101:5200	Newman, MEJ	 Coauthorship networks and patterns of scientific collaboration	P NATL ACAD SCI USA	2004
35		274.0000	FREEMAN_L(1979)1:215	FREEMAN, LC	 CENTRALITY IN SOCIAL NETWORKS CONCEPTUAL CLARIFICATION	*****	1979
36		259.0000	BORNMANN_L(2008)64:45	Bornmann, L	 What do citation counts measure? A review of studies on citing behavior	J DOC	2008
37		257.0000	ALONSO_S(2009)3:273	Alonso, S	 h-Index: A review focused in its variants, computation and standardization for different scientific fields	J INFORMETR	2009
38		257.0000	NEWMAN_M(2005)46:323	Newman, MEJ	 Power laws, Pareto distributions and Zipf's law	CONTEMP PHYS	2005
39		256.0000	WATTS_D(1998)393:440	Watts, DJ	 Collective dynamics of 'small-world' networks	*****	1998
40		255.0000	SIMON_H(1955)42:425	SIMON, HA	 ON A CLASS OF SKEW DISTRIBUTION FUNCTIONS	*****	1955
41		253.0000	EGGHE_L(1990):	Egghe, Leo 	 Introduction to Informetrics : quantitative methods in library, documentation and information science	*****	1990
42		242.0000	RAMOS-RO_A(2004)25:981	Ramos-Rodriguez, AR	 Changes in the intellectual structure of strategic management research: A bibliometric study of the Strategic Management Journal, 1980-2000	STRATEGIC MANAGE J	2004
43		241.0000	BORNER_K(2003)37:179	Borner, K	 Visualizing knowledge domains	ANNU REV INFORM SCI	2003
44		241.0000	JIN_B(2007)52:855	Jin, BH	 The R- and AR-indices: Complementing the h-index	CHINESE SCI BULL	2007
45		239.0000	KING_D(2004)430:311	King, DA	 The scientific impact of nations 	*****	2004
46		239.0000	CRANE_D(1972):	Crane, Diana  	 Invisible Colleges: Diffusion of Knowledge in Scientific Communities	*****	1972
47		230.0000	BRAUN_T(2006)69:169	Braun, T	 A Hirsch-type index for journals	SCIENTOMETRICS	2006
48		227.0000	SMALL_H(1974)4:17	SMALL, H	 STRUCTURE OF SCIENTIFIC LITERATURES .1. IDENTIFYING AND GRAPHING SPECIALTIES	*****	1974
49		226.0000	THELWALL_M(2013)8:0064841	Thelwall, M	 Do Altmetrics Work? Twitter and Ten Other Social Web Services	PLOS ONE	2013
50		223.0000	MOED_H(1995)33:381	MOED, HF	 NEW BIBLIOMETRIC TOOLS FOR THE ASSESSMENT OF NATIONAL RESEARCH PERFORMANCE - DATABASE DESCRIPTION, OVERVIEW OF INDICATORS AND FIRST APPLICATIONS	SCIENTOMETRICS	1995
==============================================================================
1. Input Degree Partition of N3 (484667)
==============================================================================
Dimension: 484667
The lowest value:     0
The highest value: 2085

Frequency distribution of cluster values:

   Cluster      Freq     Freq%   CumFreq  CumFreq% Representative
 ----------------------------------------------------------------
         0     12046    2.4854     12046    2.4854 SAID_H(2018)42:2507
         1    393496   81.1889    405542   83.6744 ABDELAAL_A(2016)2:2015037
         2     40958    8.4508    446500   92.1251 FLEISCHM_W(2016)68:153
         3     13304    2.7450    459804   94.8701 CARLSON_J(2011)72:167
         4      6473    1.3356    466277   96.2056 RATHI_V(2015)152:993
         5      3904    0.8055    470181   97.0111 INTERNAT_C(2017):
         6      2579    0.5321    472760   97.5433 MINISTRY_O(2017):
         7      1845    0.3807    474605   97.9239 CAMPBELL_E(2007)356:1742
         8      1415    0.2920    476020   98.2159 AUCKLAND_M(2012):
         9      1145    0.2362    477165   98.4521 ZHOU_P(2014)99:695
        10       855    0.1764    478020   98.6285 BEHRENS_H(2011)86:179

Boundary network

DC.clu

==============================================================================
2. C:\Mail.Ru Cloud\ANR HSE\ANR Projects\Scientometrics\Data\WoS2Pajek\DC.clu (484667)
==============================================================================
Dimension: 484667
The lowest value:  0
The highest value: 7

Frequency distribution of cluster values:

   Cluster      Freq     Freq%   CumFreq  CumFreq% Representative
 ----------------------------------------------------------------
         0    461371   95.1934    461371   95.1934 ABDELAAL_A(2016)2:2015037
         1     10582    2.1834    471953   97.3768 SAID_H(2018)42:2507
         2      2039    0.4207    473992   97.7975 HIRSCH_J(2005)102:16569
         3       894    0.1845    474886   97.9819 GUMPENBE_C(2012)33:174
         4      8069    1.6649    482955   99.6468 AYAZ_S(2016)109:1511
         5      1186    0.2447    484141   99.8915 CHENG_T(2013)52:1630
         6       283    0.0584    484424   99.9499 VANECK_N(2009)60:1635
         7       243    0.0501    484667  100.0000 HOU_J(2018)115:869
 ----------------------------------------------------------------
       Sum    484667  100.0000

Binarize partition

==============================================================================
3. Binarized C2 [1-*] (484667)
==============================================================================
Dimension: 484667
The lowest value:  0
The highest value: 1

Frequency distribution of cluster values:

   Cluster      Freq     Freq%   CumFreq  CumFreq% Representative
 ----------------------------------------------------------------
         0    461371   95.1934    461371   95.1934 ABDELAAL_A(2016)2:2015037
         1     23296    4.8066    484667  100.0000 CHENG_T(2013)52:1630
 ----------------------------------------------------------------
       Sum    484667  100.0000

DC_bin.clu

We want to take hits + works cited more than 3 times Indegree partition:
Partition - Binarize Partition - [4-*]

==============================================================================
7. Binarized C1 [4-*] (484667)
==============================================================================
Dimension: 484667
The lowest value:  0
The highest value: 1

Frequency distribution of cluster values:

   Cluster      Freq     Freq%   CumFreq  CumFreq% Representative
 ----------------------------------------------------------------
         0    459804   94.8701    459804   94.8701 ABDELAAL_A(2016)2:2015037
         1     24863    5.1299    484667  100.0000 CHENG_T(2013)52:1630
 ----------------------------------------------------------------
       Sum    484667  100.0000

Partition DC Partition Indegree Partitions - Max

==============================================================================
8. Max of C3 and C7 (484667)
==============================================================================
Dimension: 484667
The lowest value:  0
The highest value: 1

Frequency distribution of cluster values:

   Cluster      Freq     Freq%   CumFreq  CumFreq% Representative
 ----------------------------------------------------------------
         0    442416   91.2825    442416   91.2825 ABDELAAL_A(2016)2:2015037
         1     42251    8.7175    484667  100.0000 CHENG_T(2013)52:1630
 ----------------------------------------------------------------
       Sum    484667  100.0000

Intersection of 23296 (DC.clu) and 24863 (Indegree partition) is 42251

Operations - Network + Partition
Extract Subnetwork
CiteB

==============================================================================
4. Cite_Bound (42251)
==============================================================================
Number of vertices (n): 42251
----------------------------------------------------------
                                       Arcs          Edges
----------------------------------------------------------
Total number of lines                309844              0
----------------------------------------------------------
Number of loops                           0              0
Number of multiple lines                  0              0
----------------------------------------------------------

Density1 [loops allowed]    = 0.00017357
Density2 [no loops allowed] = 0.00017357
Average Degree = 14.66682445

SPC weights on network

Strong components

==============================================================================
11. Strong Components of N4 [>=2] (42251, comp.=15)
==============================================================================
Dimension: 42251
The lowest value:   0
The highest value: 15

Frequency distribution of cluster values:

   Cluster      Freq     Freq%   CumFreq  CumFreq% Representative
 ----------------------------------------------------------------
         0     42218   99.9219     42218   99.9219 CHENG_T(2013)52:1630
         1         3    0.0071     42221   99.9290 VELDEN_T(2017)111:1169
         2         2    0.0047     42223   99.9337 ZHANG_Y(2016)105:179
         3         2    0.0047     42225   99.9385 LANDSTRO_H(2012)41:1154
         4         2    0.0047     42227   99.9432 PONCE_F(2010)112:223
         5         2    0.0047     42229   99.9479 PACKER_A(2006)78:841
         6         3    0.0071     42232   99.9550 HOLDEN_G(2005)41:1
         7         2    0.0047     42234   99.9598 GARCIA-P_M(2009)81:779
         8         2    0.0047     42236   99.9645 SCHUMMER_J(1997)39:125
         9         2    0.0047     42238   99.9692 KONUR_O(2012)29:323
        10         2    0.0047     42240   99.9740 SMITH_D(2009)61:194
        11         2    0.0047     42242   99.9787 SCHAER_P(2013)38:282
        12         3    0.0071     42245   99.9858 KONUR_O(2012)4:1603
        13         2    0.0047     42247   99.9905 KONUR_O(2012)4:1935
        14         2    0.0047     42249   99.9953 WORMELL_I(2000)48:237
        15         2    0.0047     42251  100.0000 LI_X(2005)64:151
 ----------------------------------------------------------------
       Sum     42251  100.0000

Cite_Strong comp (saved)

Preprint transformation

==============================================================================
6. Preprint Transformation of N4 (42284)
==============================================================================
Number of vertices (n): 42284
----------------------------------------------------------
                                       Arcs          Edges
----------------------------------------------------------
Total number of lines                309878              0
----------------------------------------------------------
Number of loops                           0              0
Number of multiple lines                  0              0
----------------------------------------------------------

Density1 [loops allowed]    = 0.00017332
Density2 [no loops allowed] = 0.00017332
Average Degree = 14.65698609

Search path count

==============================================================================
7. Citation weights SPC [flow] of N6 (42284)
==============================================================================
Number of vertices (n): 42284
----------------------------------------------------------
                                       Arcs          Edges
----------------------------------------------------------
Number of lines with value=1              0              0
Number of lines with value#1         309878              0
----------------------------------------------------------
Total number of lines                309878              0
----------------------------------------------------------
Number of loops                           0              0
Number of multiple lines                  0              0
----------------------------------------------------------
Density1 [loops allowed]    = 0.00017332
Density2 [no loops allowed] = 0.00017332
Average Degree = 14.65698609

Cite_SPC weights (saved)

Main path

Main Paths - Global Search - Standart 63 nodes
Cite_Main path + titles (saved)

Key Routes

Main Paths - Global Search - Key Routes [1-10] = 63 nodes (same)
Main Paths - Global Search - Key Routes [1-30] = 93 nodes Cite_Key routes + titles (saved)

Islands

Islands - Generate networks with islands
Islands - Line weights [10, 200]

   Cluster      Freq     Freq%   CumFreq  CumFreq% Representative
 ----------------------------------------------------------------
         0     41735   98.7016     41735   98.7016         1
         1        11    0.0260     41746   98.7277     37442
         2        13    0.0307     41759   98.7584     42067
         3        14    0.0331     41773   98.7915     41525
         4        16    0.0378     41789   98.8293     41552
         5        95    0.2247     41884   99.0540     20751
         6        11    0.0260     41895   99.0800     35972
         7        20    0.0473     41915   99.1273     21143
         8        43    0.1017     41958   99.2290     37854
         9        30    0.0709     41988   99.3000     31210
        10        14    0.0331     42002   99.3331     35939
        11        23    0.0544     42025   99.3875     36061
        12        15    0.0355     42040   99.4229      3908
        13        10    0.0236     42050   99.4466     35637
        14        11    0.0260     42061   99.4726      1713
        15        12    0.0284     42073   99.5010      4355
        16        11    0.0260     42084   99.5270      1776
        17       200    0.4730     42284  100.0000         5
 ----------------------------------------------------------------
       Sum     42284  100.0000

Extracted all and looked at them
1, 2, 3 - Works citing 1 work (in each island - separate) of Bradford
4 - more interesting structure for 16 nodes - titles
5 - 95 nodes, can extract the titles
6 - also 2 works of Bradford, but also others
7 - works of Caraveo citing himself and others
8 - Zhang, Zhao, Song - Corean group (43 nodes)
9 - A lot of Mumford
10 - 14 nodes, can look at the titles
11 - 23 nodes, can look at the titles
12 - 15 nodes, not ineteresting, a lot of cited
13 - not interesting, a lot of Eliazar
14 - a lot of Konur, not interesting
15 - not interesting
16 - Svider, Eloy,... not ineteresting
17 - 200 nodes: too messy to draw, but we can extract the titles and compare with the MP and KR

Numbers for different sources in the Final table

j am soc inform sci	39
j informetr	33
scientometrics	29
j inform sci	10
j doc	15
annu rev inform sci	7
inform process manag	7
soc stud sci	6
libr trends	5
aslib j inform manag	4
book	4
czech j phys	3
online inform rev	3
prof inform	3
asist mon ser	2
biometrika	2
embo rep	2
front neurosci-switz	2
libr quart	2
libri	2
pro int conf sci inf	2
am sociol rev	1
anim sci pap rep	1
chinese sci bull	1
communication theory	1
curr contents	1
decis support syst	1
econ j	1
econometrica	1
eur j org chem	1
front hum neurosci	1
front mol neurosci	1
front pharmacol	1
harvard rev psychiat	1
j data info sci	1
libr inform sci res	1
manage sci	1
math comput model	1
mis quart	1
molecules	1
nature	1
oper res quart	1
p am soc inform sci	1
p natl acad sci usa	1
peerj comput sci	1
phil trans r soc b 	1
plos one	1
res evaluat	1
science	1
soc indic res	1
soc sci inform	1

sum 212 

%

j informetr	15,6
scientometrics	13,7
j am soc inform sci	18,4
j inform sci	4,7
j doc	7,1
annu rev inform sci	3,3
inform process manag	3,3
soc stud sci	2,8
libr trends	2,4
aslib j inform manag	1,9
book	1,9
czech j phys	1,4
online inform rev	1,4
prof inform	1,4
asist mon ser	0,9
biometrika	0,9
embo rep	0,9
front neurosci-switz	0,9
libr quart	0,9
libri	0,9
pro int conf sci inf	0,9
other	14,2 (30)