wj - Daria-Maltseva/mixedmethods GitHub Wiki
Journals analysis
WJ distributions
WJ
==============================================================================
Info on 2-Mode Network 1. C:\Mail.Ru Cloud\ANR HSE\ANR Projects\Mixed methods\Data\2019_FINAL DATASET_copy\WJ.net [2-Mode] (581529)
==============================================================================
Number of vertices (n): 581529
----------------------------------------------------------
Arcs Edges
----------------------------------------------------------
Total number of lines 552218 0
----------------------------------------------------------
Number of loops 0 0
Number of multiple lines 4617 0
----------------------------------------------------------
2-Mode Network: Rows=547407, Cols=34122
Density [2-Mode] = 0.00002956
Average Degree = 1.89919333
Simplification (multiple line - single line)
==============================================================================
Info on 2-Mode Network 2. Simplification (Single Line) of N1 (581529)
==============================================================================
Number of vertices (n): 581529
----------------------------------------------------------
Arcs Edges
----------------------------------------------------------
Total number of lines 547601 0
----------------------------------------------------------
Number of loops 0 0
Number of multiple lines 0 0
----------------------------------------------------------
2-Mode Network: Rows=547407, Cols=34122
Density [2-Mode] = 0.00002932
Average Degree = 1.88331450
Indegree
==============================================================================
1. Input Degree of N2 (581529)
==============================================================================
Dimension: 581529
The lowest value: 0.0000
The highest value: 184167.0000
Highest values:
Rank Vertex Value Id
--------------------------------------------------------
1 547408 184167.0000 *****
2 547415 2589.0000 SOC SCI MED
3 547426 1959.0000 J ADV NURS
4 547447 1836.0000 LANCET
5 547480 1640.0000 JAMA-J AM MED ASSOC
6 547591 1614.0000 AM J PUBLIC HEALTH
7 547575 1581.0000 BRIT MED J
8 547490 1387.0000 PEDIATRICS
9 547712 1215.0000 ACAD MED
10 547596 1100.0000 NEW ENGL J MED
11 548149 1085.0000 J CLIN NURS
12 548075 1056.0000 J GEN INTERN MED
13 547444 1022.0000 PATIENT EDUC COUNS
14 548233 961.0000 MED EDUC
15 547597 911.0000 AM J PREV MED
16 547606 909.0000 J PERS SOC PSYCHOL
17 547818 805.0000 J AM GERIATR SOC
18 547413 775.0000 QUAL HEALTH RES
19 548231 772.0000 MED TEACH
20 547908 768.0000 NURS EDUC TODAY
21 547633 724.0000 INT J NURS STUD
22 548452 710.0000 J ADOLESCENT HEALTH
23 547965 709.0000 TEACH TEACH EDUC
24 547431 709.0000 J CONSULT CLIN PSYCH
25 547411 679.0000 AIDS CARE
26 548245 672.0000 COMPUT EDUC
27 547593 662.0000 ANN INTERN MED
28 547848 660.0000 MED CARE
29 548066 638.0000 BRIT J PSYCHIAT
30 547823 637.0000 GERONTOLOGIST
31 548660 629.0000 PREV MED
32 547418 629.0000 AIDS BEHAV
33 548830 628.0000 ARCH PHYS MED REHAB
34 548805 620.0000 DISABIL REHABIL
35 548659 619.0000 HEALTH AFFAIR
36 548403 611.0000 J CLIN ONCOL
37 548210 603.0000 MED J AUSTRALIA
38 547769 602.0000 ACAD MANAGE J
39 547708 591.0000 J RES SCI TEACH
40 547787 585.0000 J APPL PSYCHOL
41 547414 580.0000 COMPUT HUM BEHAV
42 547677 579.0000 J BUS ETHICS
43 548734 572.0000 ARCH INTERN MED
44 548423 571.0000 J PAIN SYMPTOM MANAG
45 547978 568.0000 CHILD DEV
46 548179 564.0000 J EDUC PSYCHOL
47 548115 564.0000 CHILD YOUTH SERV REV
48 548187 558.0000 DIABETES CARE
49 547560 558.0000 AM PSYCHOL
50 548293 554.0000 ADDICTION
51 548285 537.0000 PSYCHIAT SERV
52 548514 528.0000 AM J PSYCHIAT
53 547636 521.0000 PALLIATIVE MED
54 547981 507.0000 WORLD DEV
55 547935 500.0000 B WORLD HEALTH ORGAN
56 547759 486.0000 ACAD MANAGE REV
57 548962 481.0000 QUAL LIFE RES
58 547992 476.0000 SCIENCE
59 548762 470.0000 J INTERPROF CARE
60 547561 466.0000 PSYCHOL BULL
--------------------------------------------------------
Sum (all values): 547601.0000
Place of J MIX METHOD RES - 547433 = 403 (75th place)
Plot
setwd("C:/Mail.Ru Cloud/ANR HSE/ANR Projects/Mixed methods/Analysis/Journals")
number<-read.table(file="WJn_Indegree.vec", sep=",", header=FALSE, skip=1)
t<-table(number)
head(t)
t
length (t)
names(t)
articles <- as.integer(names(t))
typeof(articles)
length(articles)
x <- articles[2:387]
y <- as.vector(t[2:387]
xt <- c(1,5,10,50,100,500, 1000, 2000, 2500)
plot (x,y,log="xy",pch=16, xlab="N of works", ylab="N of journals", main="N of works per journal")
axis(side=1,at=xt, labels=xt)
> head(t)
number
0 1 2 3 4 5
547407 18156 4353 2218 1346 875
Reduced network
WJ
Info: Rows=547407, Cols=34122
Make constant partition with the 2nd number (34122) (value = 1)
Open DC.clu - 1st partition
Constant partition - 2nd partition
Fuse partitions
==============================================================================
3. Fusion of C1 and C2 (581529)
==============================================================================
Dimension: 581529
The lowest value: 0
The highest value: 2
Frequency distribution of cluster values:
Cluster Freq Freq% CumFreq CumFreq% Representative
----------------------------------------------------------------
0 529238 91.0080 529238 91.0080 ABBOTT_L(2015)26:340
1 18169 3.1243 547407 94.1324 HILL_M(2018)34:71
2 34122 5.8676 581529 100.0000 *****
----------------------------------------------------------------
Sum 581529 100.0000
Extract subnetwork according to partition
==============================================================================
Extracting Subnetwork according to Partition
==============================================================================
Time spent: 0:00:00
==============================================================================
Info on 2-Mode Network 3. Extracting N2 according to C3 [1-*] (52291)
==============================================================================
Number of vertices (n): 52291
----------------------------------------------------------
Arcs Edges
----------------------------------------------------------
Total number of lines 18363 0
----------------------------------------------------------
Number of loops 0 0
Number of multiple lines 0 0
----------------------------------------------------------
2-Mode Network: Rows=18169, Cols=34122
Density [2-Mode] = 0.00002962
Average Degree = 0.70233883
Network - 2-mode networks - Partition into 2 modes.
(We see that we have reduced N of works and all the journals)
==============================================================================
5. 2-Mode partition of N3 (52291)
==============================================================================
Dimension: 52291
The lowest value: 1
The highest value: 2
Frequency distribution of cluster values:
Cluster Freq Freq% CumFreq CumFreq% Representative
----------------------------------------------------------------
1 18169 34.7459 18169 34.7459 HILL_M(2018)34:71
2 34122 65.2541 52291 100.0000 *****
----------------------------------------------------------------
Sum 52291 100.0000
Binarise partition (1)
==============================================================================
6. Binarized C5 [1] (52291)
==============================================================================
Dimension: 52291
The lowest value: 0
The highest value: 1
Frequency distribution of cluster values:
Cluster Freq Freq% CumFreq CumFreq% Representative
----------------------------------------------------------------
0 34122 65.2541 34122 65.2541 *****
1 18169 34.7459 52291 100.0000 HILL_M(2018)34:71
----------------------------------------------------------------
Sum 52291 100.0000
Input degree partition
==============================================================================
7. Input Degree Partition of N3 (52291)
==============================================================================
Dimension: 52291
The lowest value: 0
The highest value: 481
Frequency distribution of cluster values:
Cluster Freq Freq% CumFreq CumFreq% Representative
----------------------------------------------------------------
0 48154 92.0885 48154 92.0885 HILL_M(2018)34:71
1 1841 3.5207 49995 95.6092 SOC NETWORKS
2 695 1.3291 50690 96.9383 BEHAV MED
Max of 2 partitions
=============================================================================
8. Max of C6 and C7 (52291)
==============================================================================
Dimension: 52291
The lowest value: 0
The highest value: 481
Frequency distribution of cluster values:
Cluster Freq Freq% CumFreq CumFreq% Representative
----------------------------------------------------------------
0 29985 57.3426 29985 57.3426 ISSUES MENT HEALTH NURS
1 20010 38.2666 49995 95.6092 HILL_M(2018)34:71
Extract subnetwork
==============================================================================
Info on 2-Mode Network 4. Extracting N3 according to C8 [1-*] (22306)
==============================================================================
Number of vertices (n): 22306
----------------------------------------------------------
Arcs Edges
----------------------------------------------------------
Total number of lines 18363 0
----------------------------------------------------------
Number of loops 0 0
Number of multiple lines 0 0
----------------------------------------------------------
2-Mode Network: Rows=18169, Cols=4137
Density [2-Mode] = 0.00024430
Average Degree = 1.64646284
We have 18169 of works (hits) + 4137 journals
Saved as WJr
Indegree of WJr
==============================================================================
2. Input Degree of N4 (22306)
==============================================================================
Dimension: 22306
The lowest value: 0.0000
The highest value: 481.0000
Highest values:
Rank Vertex Value Id
--------------------------------------------------------
1 18170 481.0000 *****
2 20173 293.0000 BMJ OPEN
3 18193 291.0000 J MIX METHOD RES
4 18840 245.0000 BMC HEALTH SERV RES
5 18852 209.0000 BMC PUBLIC HEALTH
6 19251 208.0000 PLOS ONE
7 18793 166.0000 IMPLEMENT SCI
8 18187 121.0000 J ADV NURS
9 18576 97.0000 J CLIN NURS
10 18177 93.0000 SOC SCI MED
11 18458 87.0000 NURS EDUC TODAY
12 19610 84.0000 TRIALS
13 18904 83.0000 BMC MED EDUC
14 18400 76.0000 PROCD SOC BEHV
15 18555 72.0000 CHILD YOUTH SERV REV
16 18416 72.0000 GERONTOLOGIST
17 18178 71.0000 INT J QUAL METH
18 18325 60.0000 INT J NURS STUD
19 19526 59.0000 BMC PREGNANCY CHILDB
20 18913 57.0000 J INTERPROF CARE
21 18533 56.0000 J GEN INTERN MED
22 18175 56.0000 QUAL HEALTH RES
23 19069 55.0000 J MED INTERNET RES
24 19018 54.0000 QUAL LIFE RES
25 18584 53.0000 EVAL PROGRAM PLANN
26 18198 52.0000 MIDWIFERY
27 18986 49.0000 BMC FAM PRACT
28 18619 49.0000 MED TEACH
29 18934 48.0000 DISABIL REHABIL
30 18860 47.0000 QUAL QUANT
31 18201 47.0000 PATIENT EDUC COUNS
32 21751 46.0000 EDULEARN PROC
33 18245 46.0000 PSYCHO-ONCOLOGY
34 21376 45.0000 INTED PROC
35 18486 45.0000 TEACH TEACH EDUC
36 20091 43.0000 BMC MED INFORM DECIS
37 18465 43.0000 INT J ENV RES PUB HE
38 21754 42.0000 EUR CONF RES METH
39 20218 41.0000 JMIR RES PROTOC
40 18179 41.0000 AIDS BEHAV
41 19056 38.0000 TEACH COLL REC
42 18731 37.0000 J PAIN SYMPTOM MANAG
43 18207 37.0000 HEALTH EXPECT
44 19587 36.0000 HEALTH RES POLICY SY
45 18312 36.0000 J ADOLESCENT RES
46 18328 35.0000 PALLIATIVE MED
47 21860 34.0000 ICERI PROC
48 18456 34.0000 NURSE EDUC PRACT
49 18348 34.0000 QUAL REP
50 18826 33.0000 FRONT PSYCHOL
51 18719 33.0000 EUR J CANCER CARE
52 18660 33.0000 HEALTH SOC CARE COMM
53 18277 33.0000 BMC PSYCHIATRY
54 19513 32.0000 J PUBLIC HEALTH MAN
55 18745 32.0000 J ADOLESCENT HEALTH
56 18176 32.0000 COMPUT HUM BEHAV
57 19542 31.0000 INT J EQUITY HEALTH
58 18479 31.0000 EDUC RESEARCHER
59 18249 31.0000 SUPPORT CARE CANCER
60 18206 31.0000 BMJ QUAL SAF
--------------------------------------------------------
Sum (all values): 18363.0000
Plot in R
setwd("C:/Mail.Ru Cloud/ANR HSE/ANR Projects/Mixed methods/Analysis/Journals")
number<-read.table(file="WJr_Indegree.vec", sep=",", header=FALSE, skip=1)
t<-table(number)
head(t)
t
length (t)
names(t)
articles <- as.integer(names(t))
typeof(articles)
length(articles)
x <- articles[2:70]
y <- as.vector(t[2:70])
plot (x,y,log="xy",pch=16, xlab="N of works", ylab="N of journals", main="N of works per journal")
number
0 1 2 3 4 5
18169 1841 695 422 259 186
Citations between journals
Citation reduced
Simplified
==============================================================================
7. CiteR (547407)
==============================================================================
Number of vertices (n): 547407
----------------------------------------------------------
Arcs Edges
----------------------------------------------------------
Total number of lines 769508 0
----------------------------------------------------------
Number of loops 0 0
Number of multiple lines 0 0
----------------------------------------------------------
Density1 [loops allowed] = 0.00000257
Density2 [no loops allowed] = 0.00000257
Average Degree = 2.81146569
CiteSimpl + DCbin - Extract 1 = CiteR.net
==============================================================================
8. Extracting N7 according to C1 [1] (18169)
==============================================================================
Number of vertices (n): 18169
----------------------------------------------------------
Arcs Edges
----------------------------------------------------------
Total number of lines 25365 0
----------------------------------------------------------
Number of loops 0 0
Number of multiple lines 0 0
----------------------------------------------------------
Density1 [loops allowed] = 0.00007684
Density2 [no loops allowed] = 0.00007684
Average Degree = 2.79211844
Not normalized JCJ
Counting JCJ = t(WJr) * CiteR * WJr
WJr - Transpose 2-mode
CiteR - 1 mode to 2-mode
Select Transposed
Select CiteR 2-mode
Multiply
Select WJr - as 2nd
Multiply
JCJ.net
==============================================================================
12. JCJ (4137)
==============================================================================
Number of vertices (n): 4137
----------------------------------------------------------
Arcs Edges
----------------------------------------------------------
Number of lines with value=1 8775 0
Number of lines with value#1 3288 0
----------------------------------------------------------
Total number of lines 12063 0
----------------------------------------------------------
Number of loops 381 0
Number of multiple lines 0 0
----------------------------------------------------------
Density [loops allowed] = 0.00070483
Average Degree = 5.83176215
Loops
==============================================================================
3. Loops from N12 (4137)
==============================================================================
Dimension: 4137
The lowest value: 0.0000
The highest value: 900.0000
Highest values:
Rank Vertex Value Id
--------------------------------------------------------
1 24 900.0000 J MIX METHOD RES
2 1 501.0000 *****
3 691 40.0000 QUAL QUANT
4 435 26.0000 ANN FAM MED
5 18 23.0000 J ADV NURS
6 6 22.0000 QUAL HEALTH RES
7 156 18.0000 INT J NURS STUD
8 486 16.0000 ADM POLICY MENT HLTH
9 113 15.0000 HEALTH SERV RES
10 849 14.0000 QUAL LIFE RES
11 289 14.0000 NURS EDUC TODAY
12 170 13.0000 QUAL INQ
13 142 13.0000 J COUNS PSYCHOL
14 1933 12.0000 J DEV EFFECT
15 744 12.0000 J INTERPROF CARE
16 3528 11.0000 SPE J
17 129 11.0000 AM J PREV MED
18 29 10.0000 MIDWIFERY
19 415 10.0000 EVAL PROGRAM PLANN
20 1814 9.0000 J SUSTAIN TOUR
21 657 9.0000 FRONT PSYCHOL
22 1082 9.0000 PLOS ONE
23 407 8.0000 J CLIN NURS
24 386 8.0000 CHILD YOUTH SERV REV
25 1344 8.0000 J PUBLIC HEALTH MAN
26 483 7.0000 PSYCHIAT SERV
27 900 7.0000 J MED INTERNET RES
28 856 7.0000 J SPORT EXERCISE PSY
29 1569 7.0000 LIBR INFORM SCI RES
30 1535 7.0000 QUAL RES ACCOUNT MAN
--------------------------------------------------------
Sum (all values): 2367.0000
Count proportion of self-citation
Loops
Vector - Weighted outdegree
1st vec, 2nd vec - divide
Then we need to order them
Order by loops:
select the loops vector
Vector/Make Permutation
Permutation/Mirror Permutation
Operations/Network+Permutation/Reorder Network
Operations/Vector+Permutation/Reorder Vector
File/Vector/Change Label JCJ loops perm
(save)
select the divided vector:
Operations/Vector+Permutation/Reorder Vector
File/Vector/Change Label JCJ_divide perm
(save)
Saved both files, open in Excel - Table JCJ vectors self citation
JCJ Reduction
!!! Remove ***** 1. - network JCJ 4136 nodes
Remove loops
Line values
Threshold 15 - Remove lines
All degree - Extract partition
JCJ line cut (42 nodes)
Fractional approach
JJf = t(WJ) * n(Cite) * WJ
CiteR norm1.mcr (write N of nodes 18169) CiteR_norm
the same procedure:
WJr - Transpose 2-mode
CiteR norm - 1 mode to 2-mode
Select Transposed
Select CiteR normalized 2-mode
Multiply
Select WJr - as 2nd
Multiply
JCJnorm.net
==============================================================================
38. JCJnorm (4137)
==============================================================================
Number of vertices (n): 4137
----------------------------------------------------------
Arcs Edges
----------------------------------------------------------
Number of lines with value=1 2426 0
Number of lines with value#1 9637 0
----------------------------------------------------------
Total number of lines 12063 0
----------------------------------------------------------
Number of loops 381 0
Number of multiple lines 0 0
----------------------------------------------------------
Density [loops allowed] = 0.00070483
Average Degree = 5.83176215
Remove ***** [1] Loops
Rank Vertex Value Id
--------------------------------------------------------
1 23 97.0435 J MIX METHOD RES
2 17 10.7619 J ADV NURS
3 434 8.3571 ANN FAM MED
4 485 8.3444 ADM POLICY MENT HLTH
5 1081 7.5000 PLOS ONE
6 288 7.3000 NURS EDUC TODAY
7 743 7.1167 J INTERPROF CARE
8 141 6.9167 J COUNS PSYCHOL
9 848 6.7492 QUAL LIFE RES
10 690 5.9711 QUAL QUANT
11 385 5.5000 CHILD YOUTH SERV REV
12 155 5.2833 INT J NURS STUD
13 406 4.6500 J CLIN NURS
14 5 4.4966 QUAL HEALTH RES
15 316 4.1667 TEACH TEACH EDUC
16 128 4.1667 AM J PREV MED
17 1343 4.1167 J PUBLIC HEALTH MAN
18 1568 4.0667 LIBR INFORM SCI RES
19 899 4.0333 J MED INTERNET RES
20 7 4.0000 SOC SCI MED
21 1932 3.9167 J DEV EFFECT
22 414 3.7556 EVAL PROGRAM PLANN
23 855 3.7500 J SPORT EXERCISE PSY
24 158 3.5110 PALLIATIVE MED
25 28 3.4524 MIDWIFERY
26 1287 3.3571 EMERG MED J
27 656 3.3278 FRONT PSYCHOL
28 2079 3.1667 OSTOMY WOUND MANAG
29 169 3.0525 QUAL INQ
30 1823 3.0000 FAM SOC
--------------------------------------------------------
Sum (all values): 535.1057
Vector - Weighted outdegree
1st vec, 2nd vec - divide
Then we need to order them
Order by loops:
select the loops vector
Vector/Make Permutation
Permutation/Mirror Permutation
Operations/Network+Permutation/Reorder Network
Operations/Vector+Permutation/Reorder Vector
File/Vector/Change Label JCJ loops perm
(save)
select the divided vector:
Operations/Vector+Permutation/Reorder Vector
File/Vector/Change Label JCJ_divide perm
(save)
Saved both files, open in Excel - Table JCJ vectors self citation
JCJnorm Reduction
Line cut - treshold 4
==============================================================================
41. Deleted loops in N39 (4136)
==============================================================================
Lowest value of line: 0.01785714
Highest value of line: 14.08333333
Line Values Frequency Freq% CumFreq CumFreq%
---------------------------------------------------------------------------------------------------
( ... 0.0179] 18 0.1918 18 0.1918
( 0.0179 ... 1.5807] 9151 97.4963 9169 97.6880
( 1.5807 ... 3.1435] 168 1.7899 9337 99.4779
( 3.1435 ... 4.7063] 26 0.2770 9363 99.7550
( 4.7063 ... 6.2692] 10 0.1065 9373 99.8615
( 6.2692 ... 7.8320] 8 0.0852 9381 99.9467
( 7.8320 ... 9.3948] 4 0.0426 9385 99.9893
( 9.3948 ... 10.9577] 0 0.0000 9385 99.9893
( 10.9577 ... 12.5205] 0 0.0000 9385 99.9893
( 12.5205 ... 14.0833] 1 0.0107 9386 100.0000
---------------------------------------------------------------------------------------------------
Total 9386 100.0000
JCJnorm_line cut(27)