Cite - bavla/biblio GitHub Wiki
Citation analysis
Cite indegree
Cite.net
==============================================================================
1. C:\Mail.Ru Cloud\ANR HSE\ANR Projects\Scientometrics\Data\WoS2Pajek\Cite.net (484667)
==============================================================================
Number of vertices (n): 484667
----------------------------------------------------------
Arcs Edges
----------------------------------------------------------
Total number of lines 818538 0
----------------------------------------------------------
Number of loops 29 0
Number of multiple lines 2847 0
----------------------------------------------------------
Density [loops allowed] = 0.00000348
Average Degree = 3.37773358
Remove loops, multiple lines to single lines. Saved as Cite_simpl
==============================================================================
3. Cite_simpl (484667)
==============================================================================
Number of vertices (n): 484667
----------------------------------------------------------
Arcs Edges
----------------------------------------------------------
Total number of lines 815666 0
----------------------------------------------------------
Number of loops 0 0
Number of multiple lines 0 0
----------------------------------------------------------
Density1 [loops allowed] = 0.00000347
Density2 [no loops allowed] = 0.00000347
Average Degree = 3.36588214
Indegree vec save as separate file - Cite_Indegree.vec
- 50 works with largest indegree:
==============================================================================
1. Input Degree of N3 (484667)
==============================================================================
Dimension: 484667
The lowest value: 0.0000
The highest value: 2085.0000
Highest values:
Rank Vertex Value Id
--------------------------------------------------------
1 62 2085.0000 HIRSCH_J(2005)102:16569
2 703 774.0000 SMALL_H(1973)24:265
3 8023 772.0000 ZIPF_G(1949):
4 3691 616.0000 LOTKA_A(1926)16:317
5 95847 572.0000 BRADFORD_M(1976)72:248
6 717 557.0000 VANECK_N(2010)84:523
7 2433 551.0000 GARFIELD_E(1972)178:471
8 496 527.0000 EGGHE_L(2006)69:131
9 3704 526.0000 PRITCHAR_A(1969)25:348
10 5917 495.0000 GARFIELD_E(1955)122:108
11 2959 476.0000 PRICE_D(1963):
12 682 458.0000 PRICE_D(1965)149:510
13 2432 447.0000 GARFIELD_E(2006)295:90
14 2357 439.0000 GARFIELD_E(1979):
15 2455 407.0000 SEGLEN_P(1997)314:498
16 491 404.0000 CHEN_C(2006)57:359
17 236 402.0000 MOED_H(2005):
18 723 395.0000 WHITE_H(1998)49:327
19 722 380.0000 WHITE_H(1981)32:163
20 6044 376.0000 KATZ_J(1997)26:1
21 5530 365.0000 WASSERMA_S(1994):
22 1460 359.0000 KESSLER_M(1963)14:10
23 2162 355.0000 MERTON_R(1968)159:56
24 1172 353.0000 FALAGAS_M(2008)22:338
25 8119 325.0000 BRADFORD_S(1934)137:85
26 5644 301.0000 VANRAAN_A(2006)67:491
27 4715 300.0000 NEWMAN_M(2001)98:404
28 61 297.0000 HIRSCH_J(2007)104:19193
29 655 297.0000 MCCAIN_K(1990)41:433
30 1915 296.0000 BORGATTI_S(2002):
31 25852 292.0000 BARABASI_A(1999)286:509
32 318 284.0000 MEHO_L(2007)58:2105
33 328 277.0000 PRICE_D(1976)27:292
34 6015 276.0000 NEWMAN_M(2004)101:5200
35 7305 274.0000 FREEMAN_L(1979)1:215
36 170 259.0000 BORNMANN_L(2008)64:45
37 5226 257.0000 ALONSO_S(2009)3:273
38 36871 257.0000 NEWMAN_M(2005)46:323
39 6130 256.0000 WATTS_D(1998)393:440
40 36875 255.0000 SIMON_H(1955)42:425
41 17065 253.0000 EGGHE_L(1990):
42 689 242.0000 RAMOS-RO_A(2004)25:981
43 9161 241.0000 BORNER_K(2003)37:179
44 17840 241.0000 JIN_B(2007)52:855
45 5031 239.0000 KING_D(2004)430:311
46 581 239.0000 CRANE_D(1972):
47 15666 230.0000 BRAUN_T(2006)69:169
48 5441 227.0000 SMALL_H(1974)4:17
49 3825 226.0000 THELWALL_M(2013)8:0064841
50 15702 223.0000 MOED_H(1995)33:381
--------------------------------------------------------
Sum (all values): 815666.0000
1 2085.0000 HIRSCH_J(2005)102:16569 Hirsch, JE An index to quantify an individual's scientific research output P NATL ACAD SCI USA 2005
2 774.0000 SMALL_H(1973)24:265 SMALL, H COCITATION IN SCIENTIFIC LITERATURE - NEW MEASURE OF RELATIONSHIP BETWEEN 2 DOCUMENTS ***** 1973
3 772.0000 ZIPF_G(1949): Zipf, George Kingsley Human Behavior And The Principle Of Least Effort ***** 1949
4 616.0000 LOTKA_A(1926)16:317 Lotka, A.J. The Frequency Distribution of Scientific Productivity ***** 1926
5 572.0000 BRADFORD_M(1976)72:248 BRADFORD, MM RAPID AND SENSITIVE METHOD FOR QUANTITATION OF MICROGRAM QUANTITIES OF PROTEIN UTILIZING PRINCIPLE OF PROTEIN-DYE BINDING ***** 1976
6 557.0000 VANECK_N(2010)84:523 van Eck, NJ Software survey: VOSviewer, a computer program for bibliometric mapping SCIENTOMETRICS 2010
7 551.0000 GARFIELD_E(1972)178:471 GARFIELD, E CITATION ANALYSIS AS A TOOL IN JOURNAL EVALUATION - JOURNALS CAN BE RANKED BY FREQUENCY AND IMPACT OF CITATIONS FOR SCIENCE POLICY STUDIES ***** 1972
8 527.0000 EGGHE_L(2006)69:131 Egghe, L Theory and practise of the g-index SCIENTOMETRICS 2006
9 526.0000 PRITCHAR_A(1969)25:348 PRITCHARD, A STATISTICAL BIBLIOGRAPHY OR BIBLIOMETRICS J DOC 1969
10 495.0000 GARFIELD_E(1955)122:108 GARFIELD, E CITATION INDEXES FOR SCIENCE - NEW DIMENSION IN DOCUMENTATION THROUGH ASSOCIATION OF IDEAS ***** 1955
11 476.0000 PRICE_D(1963): Price, Derek J. Little Science, Big Science and Beyond ***** 1963
12 458.0000 PRICE_D(1965)149:510 PRICE, DJD NETWORKS OF SCIENTIFIC PAPERS ***** 1965
13 447.0000 GARFIELD_E(2006)295:90 Garfield, E The history and meaning of the journal impact factor ***** 2006
14 439.0000 GARFIELD_E(1979): Garfield, Eugene Citation Indexing - Its Theory and Application in Science, Technology, and Humanities ***** 1979
15 407.0000 SEGLEN_P(1997)314:498 Seglen, PO Why the impact factor of journals should not be used for evaluating research ***** 1997
16 404.0000 CHEN_C(2006)57:359 Chen, CM CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature ***** 2006
17 402.0000 MOED_H(2005): Moed, Henk F. Citation Analysis in Research Evaluation ***** 2005
18 395.0000 WHITE_H(1998)49:327 White, HD Visualizing a discipline: An author co-citation analysis of information science, 1972-1995 J AM SOC INFORM SCI 1998
19 380.0000 WHITE_H(1981)32:163 WHITE, HD AUTHOR COCITATION - A LITERATURE MEASURE OF INTELLECTUAL STRUCTURE ***** 1981
20 376.0000 KATZ_J(1997)26:1 Katz, JS What is research collaboration? ***** 1997
21 365.0000 WASSERMA_S(1994): Wasserman, S. Social Network Analysis: Methods and Applications ***** 1994
22 359.0000 KESSLER_M(1963)14:10 KESSLER, MM BIBLIOGRAPHIC COUPLING BETWEEN SCIENTIFIC PAPERS AM DOC 1963
23 355.0000 MERTON_R(1968)159:56 MERTON, RK MATTHEW EFFECT IN SCIENCE ***** 1968
24 353.0000 FALAGAS_M(2008)22:338 Falagas, ME Comparison of PubMed, Scopus, Web of Science, and Google Scholar: strengths and weaknesses ***** 2008
25 325.0000 BRADFORD_S(1934)137:85 Bradford, S.C. Sources of information on specific subjects ***** 1934
26 301.0000 VANRAAN_A(2006)67:491 Van Raan, AFJ Comparison of the Hirsch-index with standard bibliometric indicators and with peer judgment for 147 chemistry research groups SCIENTOMETRICS 2006
27 300.0000 NEWMAN_M(2001)98:404 Newman, MEJ The structure of scientific collaboration networks ***** 2001
28 297.0000 HIRSCH_J(2007)104:19193 Hirsch, JE Does the h index have predictive power? P NATL ACAD SCI USA 2007
29 297.0000 MCCAIN_K(1990)41:433 MCCAIN, KW MAPPING AUTHORS IN INTELLECTUAL SPACE - A TECHNICAL OVERVIEW ***** 1990
30 296.0000 BORGATTI_S(2002): Borgatti, S. P. Ucinet 6 for Windows: Software for Social Network Analysis ***** 2002
31 292.0000 BARABASI_A(1999)286:509 Barabasi, AL Emergence of scaling in random networks ***** 1999
32 284.0000 MEHO_L(2007)58:2105 Meho, LI Impact of data sources on citation counts and rankings of LIS faculty: Web of science versus scopus and google scholar J AM SOC INF SCI TEC 2007
33 277.0000 PRICE_D(1976)27:292 PRICE, DJD GENERAL THEORY OF BIBLIOMETRIC AND OTHER CUMULATIVE ADVANTAGE PROCESSES J AM SOC INFORM SCI 1976
34 276.0000 NEWMAN_M(2004)101:5200 Newman, MEJ Coauthorship networks and patterns of scientific collaboration P NATL ACAD SCI USA 2004
35 274.0000 FREEMAN_L(1979)1:215 FREEMAN, LC CENTRALITY IN SOCIAL NETWORKS CONCEPTUAL CLARIFICATION ***** 1979
36 259.0000 BORNMANN_L(2008)64:45 Bornmann, L What do citation counts measure? A review of studies on citing behavior J DOC 2008
37 257.0000 ALONSO_S(2009)3:273 Alonso, S h-Index: A review focused in its variants, computation and standardization for different scientific fields J INFORMETR 2009
38 257.0000 NEWMAN_M(2005)46:323 Newman, MEJ Power laws, Pareto distributions and Zipf's law CONTEMP PHYS 2005
39 256.0000 WATTS_D(1998)393:440 Watts, DJ Collective dynamics of 'small-world' networks ***** 1998
40 255.0000 SIMON_H(1955)42:425 SIMON, HA ON A CLASS OF SKEW DISTRIBUTION FUNCTIONS ***** 1955
41 253.0000 EGGHE_L(1990): Egghe, Leo Introduction to Informetrics : quantitative methods in library, documentation and information science ***** 1990
42 242.0000 RAMOS-RO_A(2004)25:981 Ramos-Rodriguez, AR Changes in the intellectual structure of strategic management research: A bibliometric study of the Strategic Management Journal, 1980-2000 STRATEGIC MANAGE J 2004
43 241.0000 BORNER_K(2003)37:179 Borner, K Visualizing knowledge domains ANNU REV INFORM SCI 2003
44 241.0000 JIN_B(2007)52:855 Jin, BH The R- and AR-indices: Complementing the h-index CHINESE SCI BULL 2007
45 239.0000 KING_D(2004)430:311 King, DA The scientific impact of nations ***** 2004
46 239.0000 CRANE_D(1972): Crane, Diana Invisible Colleges: Diffusion of Knowledge in Scientific Communities ***** 1972
47 230.0000 BRAUN_T(2006)69:169 Braun, T A Hirsch-type index for journals SCIENTOMETRICS 2006
48 227.0000 SMALL_H(1974)4:17 SMALL, H STRUCTURE OF SCIENTIFIC LITERATURES .1. IDENTIFYING AND GRAPHING SPECIALTIES ***** 1974
49 226.0000 THELWALL_M(2013)8:0064841 Thelwall, M Do Altmetrics Work? Twitter and Ten Other Social Web Services PLOS ONE 2013
50 223.0000 MOED_H(1995)33:381 MOED, HF NEW BIBLIOMETRIC TOOLS FOR THE ASSESSMENT OF NATIONAL RESEARCH PERFORMANCE - DATABASE DESCRIPTION, OVERVIEW OF INDICATORS AND FIRST APPLICATIONS SCIENTOMETRICS 1995
==============================================================================
1. Input Degree Partition of N3 (484667)
==============================================================================
Dimension: 484667
The lowest value: 0
The highest value: 2085
Frequency distribution of cluster values:
Cluster Freq Freq% CumFreq CumFreq% Representative
----------------------------------------------------------------
0 12046 2.4854 12046 2.4854 SAID_H(2018)42:2507
1 393496 81.1889 405542 83.6744 ABDELAAL_A(2016)2:2015037
2 40958 8.4508 446500 92.1251 FLEISCHM_W(2016)68:153
3 13304 2.7450 459804 94.8701 CARLSON_J(2011)72:167
4 6473 1.3356 466277 96.2056 RATHI_V(2015)152:993
5 3904 0.8055 470181 97.0111 INTERNAT_C(2017):
6 2579 0.5321 472760 97.5433 MINISTRY_O(2017):
7 1845 0.3807 474605 97.9239 CAMPBELL_E(2007)356:1742
8 1415 0.2920 476020 98.2159 AUCKLAND_M(2012):
9 1145 0.2362 477165 98.4521 ZHOU_P(2014)99:695
10 855 0.1764 478020 98.6285 BEHRENS_H(2011)86:179
Boundary network
DC.clu
==============================================================================
2. C:\Mail.Ru Cloud\ANR HSE\ANR Projects\Scientometrics\Data\WoS2Pajek\DC.clu (484667)
==============================================================================
Dimension: 484667
The lowest value: 0
The highest value: 7
Frequency distribution of cluster values:
Cluster Freq Freq% CumFreq CumFreq% Representative
----------------------------------------------------------------
0 461371 95.1934 461371 95.1934 ABDELAAL_A(2016)2:2015037
1 10582 2.1834 471953 97.3768 SAID_H(2018)42:2507
2 2039 0.4207 473992 97.7975 HIRSCH_J(2005)102:16569
3 894 0.1845 474886 97.9819 GUMPENBE_C(2012)33:174
4 8069 1.6649 482955 99.6468 AYAZ_S(2016)109:1511
5 1186 0.2447 484141 99.8915 CHENG_T(2013)52:1630
6 283 0.0584 484424 99.9499 VANECK_N(2009)60:1635
7 243 0.0501 484667 100.0000 HOU_J(2018)115:869
----------------------------------------------------------------
Sum 484667 100.0000
Binarize partition
==============================================================================
3. Binarized C2 [1-*] (484667)
==============================================================================
Dimension: 484667
The lowest value: 0
The highest value: 1
Frequency distribution of cluster values:
Cluster Freq Freq% CumFreq CumFreq% Representative
----------------------------------------------------------------
0 461371 95.1934 461371 95.1934 ABDELAAL_A(2016)2:2015037
1 23296 4.8066 484667 100.0000 CHENG_T(2013)52:1630
----------------------------------------------------------------
Sum 484667 100.0000
DC_bin.clu
We want to take hits + works cited more than 3 times
Indegree partition:
Partition - Binarize Partition - [4-*]
==============================================================================
7. Binarized C1 [4-*] (484667)
==============================================================================
Dimension: 484667
The lowest value: 0
The highest value: 1
Frequency distribution of cluster values:
Cluster Freq Freq% CumFreq CumFreq% Representative
----------------------------------------------------------------
0 459804 94.8701 459804 94.8701 ABDELAAL_A(2016)2:2015037
1 24863 5.1299 484667 100.0000 CHENG_T(2013)52:1630
----------------------------------------------------------------
Sum 484667 100.0000
Partition DC Partition Indegree Partitions - Max
==============================================================================
8. Max of C3 and C7 (484667)
==============================================================================
Dimension: 484667
The lowest value: 0
The highest value: 1
Frequency distribution of cluster values:
Cluster Freq Freq% CumFreq CumFreq% Representative
----------------------------------------------------------------
0 442416 91.2825 442416 91.2825 ABDELAAL_A(2016)2:2015037
1 42251 8.7175 484667 100.0000 CHENG_T(2013)52:1630
----------------------------------------------------------------
Sum 484667 100.0000
Intersection of 23296 (DC.clu) and 24863 (Indegree partition) is 42251
Operations - Network + Partition
Extract Subnetwork
CiteB
==============================================================================
4. Cite_Bound (42251)
==============================================================================
Number of vertices (n): 42251
----------------------------------------------------------
Arcs Edges
----------------------------------------------------------
Total number of lines 309844 0
----------------------------------------------------------
Number of loops 0 0
Number of multiple lines 0 0
----------------------------------------------------------
Density1 [loops allowed] = 0.00017357
Density2 [no loops allowed] = 0.00017357
Average Degree = 14.66682445
SPC weights on network
Strong components
==============================================================================
11. Strong Components of N4 [>=2] (42251, comp.=15)
==============================================================================
Dimension: 42251
The lowest value: 0
The highest value: 15
Frequency distribution of cluster values:
Cluster Freq Freq% CumFreq CumFreq% Representative
----------------------------------------------------------------
0 42218 99.9219 42218 99.9219 CHENG_T(2013)52:1630
1 3 0.0071 42221 99.9290 VELDEN_T(2017)111:1169
2 2 0.0047 42223 99.9337 ZHANG_Y(2016)105:179
3 2 0.0047 42225 99.9385 LANDSTRO_H(2012)41:1154
4 2 0.0047 42227 99.9432 PONCE_F(2010)112:223
5 2 0.0047 42229 99.9479 PACKER_A(2006)78:841
6 3 0.0071 42232 99.9550 HOLDEN_G(2005)41:1
7 2 0.0047 42234 99.9598 GARCIA-P_M(2009)81:779
8 2 0.0047 42236 99.9645 SCHUMMER_J(1997)39:125
9 2 0.0047 42238 99.9692 KONUR_O(2012)29:323
10 2 0.0047 42240 99.9740 SMITH_D(2009)61:194
11 2 0.0047 42242 99.9787 SCHAER_P(2013)38:282
12 3 0.0071 42245 99.9858 KONUR_O(2012)4:1603
13 2 0.0047 42247 99.9905 KONUR_O(2012)4:1935
14 2 0.0047 42249 99.9953 WORMELL_I(2000)48:237
15 2 0.0047 42251 100.0000 LI_X(2005)64:151
----------------------------------------------------------------
Sum 42251 100.0000
Cite_Strong comp (saved)
Preprint transformation
==============================================================================
6. Preprint Transformation of N4 (42284)
==============================================================================
Number of vertices (n): 42284
----------------------------------------------------------
Arcs Edges
----------------------------------------------------------
Total number of lines 309878 0
----------------------------------------------------------
Number of loops 0 0
Number of multiple lines 0 0
----------------------------------------------------------
Density1 [loops allowed] = 0.00017332
Density2 [no loops allowed] = 0.00017332
Average Degree = 14.65698609
Search path count
==============================================================================
7. Citation weights SPC [flow] of N6 (42284)
==============================================================================
Number of vertices (n): 42284
----------------------------------------------------------
Arcs Edges
----------------------------------------------------------
Number of lines with value=1 0 0
Number of lines with value#1 309878 0
----------------------------------------------------------
Total number of lines 309878 0
----------------------------------------------------------
Number of loops 0 0
Number of multiple lines 0 0
----------------------------------------------------------
Density1 [loops allowed] = 0.00017332
Density2 [no loops allowed] = 0.00017332
Average Degree = 14.65698609
Cite_SPC weights (saved)
Main path
Main Paths - Global Search - Standart
63 nodes
Cite_Main path + titles (saved)
Key Routes
Main Paths - Global Search - Key Routes [1-10] = 63 nodes (same)
Main Paths - Global Search - Key Routes [1-30] = 93 nodes
Cite_Key routes + titles (saved)
Islands
Islands - Generate networks with islands
Islands - Line weights [10, 200]
Cluster Freq Freq% CumFreq CumFreq% Representative
----------------------------------------------------------------
0 41735 98.7016 41735 98.7016 1
1 11 0.0260 41746 98.7277 37442
2 13 0.0307 41759 98.7584 42067
3 14 0.0331 41773 98.7915 41525
4 16 0.0378 41789 98.8293 41552
5 95 0.2247 41884 99.0540 20751
6 11 0.0260 41895 99.0800 35972
7 20 0.0473 41915 99.1273 21143
8 43 0.1017 41958 99.2290 37854
9 30 0.0709 41988 99.3000 31210
10 14 0.0331 42002 99.3331 35939
11 23 0.0544 42025 99.3875 36061
12 15 0.0355 42040 99.4229 3908
13 10 0.0236 42050 99.4466 35637
14 11 0.0260 42061 99.4726 1713
15 12 0.0284 42073 99.5010 4355
16 11 0.0260 42084 99.5270 1776
17 200 0.4730 42284 100.0000 5
----------------------------------------------------------------
Sum 42284 100.0000
Extracted all and looked at them
1, 2, 3 - Works citing 1 work (in each island - separate) of Bradford
4 - more interesting structure for 16 nodes - titles
5 - 95 nodes, can extract the titles
6 - also 2 works of Bradford, but also others
7 - works of Caraveo citing himself and others
8 - Zhang, Zhao, Song - Corean group (43 nodes)
9 - A lot of Mumford
10 - 14 nodes, can look at the titles
11 - 23 nodes, can look at the titles
12 - 15 nodes, not ineteresting, a lot of cited
13 - not interesting, a lot of Eliazar
14 - a lot of Konur, not interesting
15 - not interesting
16 - Svider, Eloy,... not ineteresting
17 - 200 nodes: too messy to draw, but we can extract the titles and compare with the MP and KR
Numbers for different sources in the Final table
j am soc inform sci 39
j informetr 33
scientometrics 29
j inform sci 10
j doc 15
annu rev inform sci 7
inform process manag 7
soc stud sci 6
libr trends 5
aslib j inform manag 4
book 4
czech j phys 3
online inform rev 3
prof inform 3
asist mon ser 2
biometrika 2
embo rep 2
front neurosci-switz 2
libr quart 2
libri 2
pro int conf sci inf 2
am sociol rev 1
anim sci pap rep 1
chinese sci bull 1
communication theory 1
curr contents 1
decis support syst 1
econ j 1
econometrica 1
eur j org chem 1
front hum neurosci 1
front mol neurosci 1
front pharmacol 1
harvard rev psychiat 1
j data info sci 1
libr inform sci res 1
manage sci 1
math comput model 1
mis quart 1
molecules 1
nature 1
oper res quart 1
p am soc inform sci 1
p natl acad sci usa 1
peerj comput sci 1
phil trans r soc b 1
plos one 1
res evaluat 1
science 1
soc indic res 1
soc sci inform 1
sum 212
%
j informetr 15,6
scientometrics 13,7
j am soc inform sci 18,4
j inform sci 4,7
j doc 7,1
annu rev inform sci 3,3
inform process manag 3,3
soc stud sci 2,8
libr trends 2,4
aslib j inform manag 1,9
book 1,9
czech j phys 1,4
online inform rev 1,4
prof inform 1,4
asist mon ser 0,9
biometrika 0,9
embo rep 0,9
front neurosci-switz 0,9
libr quart 0,9
libri 0,9
pro int conf sci inf 0,9
other 14,2 (30)