Skip to contents

copublication_data() returns a tibble enumerating all pmid acknowledging more than one project.

copublication() summarizes collaboration between projects at the time of publication.

cocitation_data() returns a tibble enumerating all pmid cited by other program pmid.

cocitation() summarizes collaborations between projects through citation.

Usage

copublication_data(tbl, exclude = NULL)

copublication(tbl)

cocitation_data(tbl)

cocitation(tbl)

Arguments

tbl

a tibble with column opportunity_number containing the Funding Opportunity Announcements defining the projects of interest.

exclude

character() pmid to exclude from summary.

Value

copublication_data() returns a tibble with the following columns.

  • core_project_num.x, core_project_num.y: character() pairwise project collaboration.

  • n: intger() number of collaborations between projects.

copublication() returns a tibble with columns

  • core_project_num: character() project numbers involved in collaboration

  • n: integer() total number of publications from the project.

  • citn: integer() total number of citations to project publications.

  • rcr: numeric() total relative citation rate of project publications.

  • collab: integer() number of distinct collaborators, across all collaborative publications.

  • n_collab: integer() number of collaborative publications.

  • citn_collab: numeric() total 'citation_count' for collaborative publications.

  • rcr_collab: numeric() total 'relative citation ratio' for collaborative publications.

cocitation_data() returns a tibble with the following columns.

  • pmid: integer() pmid of original publication.

  • cited_by: integer() pmid of publication citing original publication.

  • core_project_num: character() project number of project acknowledged in original publication.

  • cited_by_core_project_num: character() project number of project citing the original publication.

cocitation() returns a tibble with columns

  • project: character() project number associated with publications.

  • n: integer() total number of publications from the project.

  • citn: integer() total number of citations to project publications.

  • rcr: numeric() total relative citation rate of project publications.

  • n_self_citn: integer() number of citations to other publications of the same project.

  • n_collab_citn: integer() number of publications cited by other projects.

  • n_collab: integer() total number of projects citing this project.

  • rcr_citn: numeric() total 'relative citation ratio' of collaborative cited publications.

Examples

foas <- tribble(
    ~opportunity_number,       ~description,
    "RFA-CA-19-039", "Early-Stage Development of ...",
    "RFA-CA-19-038", "Development of Innovative ...",
    "RFA-CA-19-040", "Advanced Development of ...",
    "RFA-CA-19-041", "Sustained Support of ..."
)

copublication_data(foas)
#> # A tibble: 0 × 3
#> # ℹ 3 variables: core_project_num.x <chr>, core_project_num.y <chr>, n <int>

copublication(foas)
#> # A tibble: 16 × 8
#>    core_project_num     n  citn    rcr collab n_collab citn_collab rcr_collab
#>    <chr>            <int> <dbl>  <dbl>  <int>    <int>       <dbl>      <dbl>
#>  1 R21CA248118          6    76  11.7      NA       NA          NA         NA
#>  2 R21CA248122          9    98  10.8      NA       NA          NA         NA
#>  3 R21CA253408         12    51   7.57     NA       NA          NA         NA
#>  4 R21CA253498         16   144  36.4      NA       NA          NA         NA
#>  5 U01CA247760         10    69   8.45     NA       NA          NA         NA
#>  6 U01CA248226         74  1178 178.       NA       NA          NA         NA
#>  7 U01CA248235         12   349  49.1      NA       NA          NA         NA
#>  8 U01CA253403         24   298  30.6      NA       NA          NA         NA
#>  9 U01CA253511          5     5   0.83     NA       NA          NA         NA
#> 10 U24CA248010          5    61   4.65     NA       NA          NA         NA
#> 11 U24CA248265         18   455  48.7      NA       NA          NA         NA
#> 12 U24CA248453         14    89  22.4      NA       NA          NA         NA
#> 13 U24CA248454         15   189  41.4      NA       NA          NA         NA
#> 14 U24CA248457         35   933  58.4      NA       NA          NA         NA
#> 15 U24CA253531          6    57   9.99     NA       NA          NA         NA
#> 16 U24CA253539          1     3   0        NA       NA          NA         NA

cocite_data <- cocitation_data(foas)
cocite_data
#> # A tibble: 170 × 4
#>        pmid cited_by core_project_num cited_by_core_project_num
#>       <int>    <dbl> <chr>            <chr>                    
#>  1 29985391 31913126 U24CA248457      U24CA248457              
#>  2 29985391 30773340 U24CA248457      U24CA248457              
#>  3 30563849 35977718 U24CA248457      U24CA248457              
#>  4 30723579 32530570 U24CA248457      U24CA248457              
#>  5 30723579 34641956 U24CA248457      U24CA248457              
#>  6 30773340 31913126 U24CA248457      U24CA248457              
#>  7 30773340 31171722 U24CA248457      U24CA248457              
#>  8 30773340 34376580 U24CA248457      U24CA248457              
#>  9 31171722 37308665 U24CA248457      U24CA248457              
#> 10 31395609 36990288 U24CA248010      U24CA248010              
#> # ℹ 160 more rows

## pmid / core_project_num (and cited_by / cited_by_core_project_num)
## reflects the fact that a single pmid may acknowledge several
## projects
cocite_data |>
    distinct(pmid, core_project_num) |>
    count(pmid, sort = TRUE)
#> # A tibble: 93 × 2
#>        pmid     n
#>       <int> <int>
#>  1 29985391     1
#>  2 30563849     1
#>  3 30723579     1
#>  4 30773340     1
#>  5 31171722     1
#>  6 31395609     1
#>  7 31913126     1
#>  8 31932491     1
#>  9 31974273     1
#> 10 32119649     1
#> # ℹ 83 more rows

## cocitation 'edges' between projects, and their weights
cocite_data |>
    count(core_project_num, cited_by_core_project_num, sort = TRUE)
#> # A tibble: 20 × 3
#>    core_project_num cited_by_core_project_num     n
#>    <chr>            <chr>                     <int>
#>  1 U01CA248226      U01CA248226                  67
#>  2 U01CA253403      U01CA253403                  33
#>  3 U24CA248457      U24CA248457                  15
#>  4 R21CA248122      R21CA248122                  11
#>  5 R21CA248118      R21CA248118                   9
#>  6 U24CA248454      U24CA248454                   5
#>  7 U24CA253531      U24CA253531                   4
#>  8 R21CA253408      R21CA253408                   3
#>  9 R21CA253498      R21CA253498                   3
#> 10 U01CA248235      U01CA248235                   3
#> 11 U24CA248265      U24CA248453                   3
#> 12 U24CA248453      U24CA248453                   3
#> 13 U01CA247760      U24CA248453                   2
#> 14 U24CA248010      U24CA248010                   2
#> 15 U24CA248265      U24CA248265                   2
#> 16 U01CA248226      R21CA253498                   1
#> 17 U24CA248010      U24CA248265                   1
#> 18 U24CA248265      U24CA248457                   1
#> 19 U24CA248454      U24CA248265                   1
#> 20 U24CA248457      U01CA247760                   1

cocitation(foas)
#> # A tibble: 16 × 8
#>    core_project_num     n  citn    rcr n_self_citn n_collab_citn n_collab
#>    <chr>            <int> <dbl>  <dbl>       <int>         <int>    <int>
#>  1 R21CA248118          6    76  11.7            9            NA       NA
#>  2 R21CA248122          9    98  10.8           11            NA       NA
#>  3 R21CA253408         12    51   7.57           3            NA       NA
#>  4 R21CA253498         16   144  36.4            3            NA       NA
#>  5 U01CA247760         10    69   8.45          NA             2        1
#>  6 U01CA248226         74  1178 178.            67             1        1
#>  7 U01CA248235         12   349  49.1            3            NA       NA
#>  8 U01CA253403         24   298  30.6           33            NA       NA
#>  9 U01CA253511          5     5   0.83          NA            NA       NA
#> 10 U24CA248010          5    61   4.65           2             1        1
#> 11 U24CA248265         18   455  48.7            2             4        2
#> 12 U24CA248453         14    89  22.4            3            NA       NA
#> 13 U24CA248454         15   189  41.4            5             1        1
#> 14 U24CA248457         35   933  58.4           15             1        1
#> 15 U24CA253531          6    57   9.99           4            NA       NA
#> 16 U24CA253539          1     3   0             NA            NA       NA
#> # ℹ 1 more variable: rcr_citn <dbl>