Child pages
  • Extract Document Co-Citation Network

Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3

This documentation refers to the database version of 'Extract Document Co-Citation'. There is also a non-database version of the same algorithm that I don't believe is currently documented anywhere. The concept is the same but the details are different.


Extracts the document co-citation network from an ISI database.

Each document and each referenced document in the input database is represented by a node. An edge is drawn between the nodes for two documents if and only if they were cited by a common document in your dataset.

Core Document vs. Non-Core Document

There is a distinction drawn between documents contained in your dataset and documents in general. A document in your dataset is called a "core document". Your documents may (and probably do) reference non-core documents.

The output network of this algorithm will contain nodes representing even non-core documents. 


The output network will include the following summaries of your dataset:

  • Node (Document)
    • A prettified label for identifying this document.
    • A prettified string giving the journal (or other source) of this document (called 'source').
  • Edge (Co-Citation)
    • The number of documents in your dataset or referenced in your dataset which cited both of these documents.
      • This figure is also given with three common normalizations: Jaccard, cosine, and Dice.
    • Publication year of the earliest co-citing document.
    • Publication year of the most recent co-citing document.
Usage Hints

Load a dataset into the tool, then extract the paper-citation network and execute this algorithm.

See Also

Incoming Links