Menu Path

Data Preparation > Database > ISI > Extract Documents By Year For Burst Detection


This algorithm extracts all of the document titles, abstract texts, full texts, and publication years included in the ISI dataset. It produces a table designed to be processed by the burst detection algorithm. This table consists of the following columns:

Usage Hints

Load an ISI file into the tool, then create a database from it using the ISI database loader.

Given your ISI database, run this extraction on it. Normalize the text of the results on the column you are detecting bursts on (DOCUMENT_TITLE, DOCUMENT_ABSTRACT, or DOCUMENT_TITLE_AND_ABSTRACT). Run burst detection on the results of the normalized text.

Implementation Details

The specific query run by the tool can be found in the source code.

See Also