Program project collaboration through publication and citation
Source:R/collaboration.R
collaboration.Rd
copublication_data()
returns a tibble enumerating
all pmid acknowledging more than one project.
copublication()
summarizes collaboration between
projects at the time of publication.
cocitation_data()
returns a tibble enumerating all
pmid cited by other program pmid.
cocitation()
summarizes collaborations between
projects through citation.
Usage
copublication_data(tbl, exclude = NULL)
copublication(tbl)
cocitation_data(tbl)
cocitation(tbl)
Arguments
- tbl
a tibble with column
opportunity_number
containing the Funding Opportunity Announcements defining the projects of interest.- exclude
character() pmid to exclude from summary.
Value
copublication_data()
returns a tibble with the following
columns.
core_project_num.x
,core_project_num.y
: character() pairwise project collaboration.n
: intger() number of collaborations between projects.
copublication()
returns a tibble with columns
core_project_num
: character() project numbers involved in collaborationn
: integer() total number of publications from the project.citn
: integer() total number of citations to project publications.rcr
: numeric() total relative citation rate of project publications.collab
: integer() number of distinct collaborators, across all collaborative publications.n_collab
: integer() number of collaborative publications.citn_collab
: numeric() total 'citation_count' for collaborative publications.rcr_collab
: numeric() total 'relative citation ratio' for collaborative publications.
cocitation_data()
returns a tibble with the following
columns.
pmid
: integer() pmid of original publication.cited_by
: integer() pmid of publication citing original publication.core_project_num
: character() project number of project acknowledged in original publication.cited_by_core_project_num
: character() project number of project citing the original publication.
cocitation()
returns a tibble with columns
project
: character() project number associated with publications.n
: integer() total number of publications from the project.citn
: integer() total number of citations to project publications.rcr
: numeric() total relative citation rate of project publications.n_self_citn
: integer() number of citations to other publications of the same project.n_collab_citn
: integer() number of publications cited by other projects.n_collab
: integer() total number of projects citing this project.rcr_citn
: numeric() total 'relative citation ratio' of collaborative cited publications.
Examples
foas <- tribble(
~opportunity_number, ~description,
"RFA-CA-19-039", "Early-Stage Development of ...",
"RFA-CA-19-038", "Development of Innovative ...",
"RFA-CA-19-040", "Advanced Development of ...",
"RFA-CA-19-041", "Sustained Support of ..."
)
copublication_data(foas)
#> # A tibble: 0 × 3
#> # ℹ 3 variables: core_project_num.x <chr>, core_project_num.y <chr>, n <int>
copublication(foas)
#> # A tibble: 16 × 8
#> core_project_num n citn rcr collab n_collab citn_collab rcr_collab
#> <chr> <int> <dbl> <dbl> <int> <int> <dbl> <dbl>
#> 1 R21CA248118 6 76 11.7 NA NA NA NA
#> 2 R21CA248122 9 98 10.8 NA NA NA NA
#> 3 R21CA253408 12 51 7.57 NA NA NA NA
#> 4 R21CA253498 16 144 36.4 NA NA NA NA
#> 5 U01CA247760 10 69 8.45 NA NA NA NA
#> 6 U01CA248226 74 1178 178. NA NA NA NA
#> 7 U01CA248235 12 349 49.1 NA NA NA NA
#> 8 U01CA253403 24 298 30.6 NA NA NA NA
#> 9 U01CA253511 5 5 0.83 NA NA NA NA
#> 10 U24CA248010 5 61 4.65 NA NA NA NA
#> 11 U24CA248265 18 455 48.7 NA NA NA NA
#> 12 U24CA248453 14 89 22.4 NA NA NA NA
#> 13 U24CA248454 15 189 41.4 NA NA NA NA
#> 14 U24CA248457 35 933 58.4 NA NA NA NA
#> 15 U24CA253531 6 57 9.99 NA NA NA NA
#> 16 U24CA253539 1 3 0 NA NA NA NA
cocite_data <- cocitation_data(foas)
cocite_data
#> # A tibble: 170 × 4
#> pmid cited_by core_project_num cited_by_core_project_num
#> <int> <dbl> <chr> <chr>
#> 1 29985391 31913126 U24CA248457 U24CA248457
#> 2 29985391 30773340 U24CA248457 U24CA248457
#> 3 30563849 35977718 U24CA248457 U24CA248457
#> 4 30723579 32530570 U24CA248457 U24CA248457
#> 5 30723579 34641956 U24CA248457 U24CA248457
#> 6 30773340 31913126 U24CA248457 U24CA248457
#> 7 30773340 31171722 U24CA248457 U24CA248457
#> 8 30773340 34376580 U24CA248457 U24CA248457
#> 9 31171722 37308665 U24CA248457 U24CA248457
#> 10 31395609 36990288 U24CA248010 U24CA248010
#> # ℹ 160 more rows
## pmid / core_project_num (and cited_by / cited_by_core_project_num)
## reflects the fact that a single pmid may acknowledge several
## projects
cocite_data |>
distinct(pmid, core_project_num) |>
count(pmid, sort = TRUE)
#> # A tibble: 93 × 2
#> pmid n
#> <int> <int>
#> 1 29985391 1
#> 2 30563849 1
#> 3 30723579 1
#> 4 30773340 1
#> 5 31171722 1
#> 6 31395609 1
#> 7 31913126 1
#> 8 31932491 1
#> 9 31974273 1
#> 10 32119649 1
#> # ℹ 83 more rows
## cocitation 'edges' between projects, and their weights
cocite_data |>
count(core_project_num, cited_by_core_project_num, sort = TRUE)
#> # A tibble: 20 × 3
#> core_project_num cited_by_core_project_num n
#> <chr> <chr> <int>
#> 1 U01CA248226 U01CA248226 67
#> 2 U01CA253403 U01CA253403 33
#> 3 U24CA248457 U24CA248457 15
#> 4 R21CA248122 R21CA248122 11
#> 5 R21CA248118 R21CA248118 9
#> 6 U24CA248454 U24CA248454 5
#> 7 U24CA253531 U24CA253531 4
#> 8 R21CA253408 R21CA253408 3
#> 9 R21CA253498 R21CA253498 3
#> 10 U01CA248235 U01CA248235 3
#> 11 U24CA248265 U24CA248453 3
#> 12 U24CA248453 U24CA248453 3
#> 13 U01CA247760 U24CA248453 2
#> 14 U24CA248010 U24CA248010 2
#> 15 U24CA248265 U24CA248265 2
#> 16 U01CA248226 R21CA253498 1
#> 17 U24CA248010 U24CA248265 1
#> 18 U24CA248265 U24CA248457 1
#> 19 U24CA248454 U24CA248265 1
#> 20 U24CA248457 U01CA247760 1
cocitation(foas)
#> # A tibble: 16 × 8
#> core_project_num n citn rcr n_self_citn n_collab_citn n_collab
#> <chr> <int> <dbl> <dbl> <int> <int> <int>
#> 1 R21CA248118 6 76 11.7 9 NA NA
#> 2 R21CA248122 9 98 10.8 11 NA NA
#> 3 R21CA253408 12 51 7.57 3 NA NA
#> 4 R21CA253498 16 144 36.4 3 NA NA
#> 5 U01CA247760 10 69 8.45 NA 2 1
#> 6 U01CA248226 74 1178 178. 67 1 1
#> 7 U01CA248235 12 349 49.1 3 NA NA
#> 8 U01CA253403 24 298 30.6 33 NA NA
#> 9 U01CA253511 5 5 0.83 NA NA NA
#> 10 U24CA248010 5 61 4.65 2 1 1
#> 11 U24CA248265 18 455 48.7 2 4 2
#> 12 U24CA248453 14 89 22.4 3 NA NA
#> 13 U24CA248454 15 189 41.4 5 1 1
#> 14 U24CA248457 35 933 58.4 15 1 1
#> 15 U24CA253531 6 57 9.99 4 NA NA
#> 16 U24CA253539 1 3 0 NA NA NA
#> # ℹ 1 more variable: rcr_citn <dbl>