Child pages
  • Detect Duplicate Nodes
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

Description

'Detect Duplicate Nodes' helps to find duplicate nodes in a network, by checking to see if any nodes in the network have labels that are very similar to each other.

It creates produces three results: a 'Merge Table' that can be used in conjunction with the Update Network by Merging Nodes

Pros & Cons

Does a good straight-forward job of removing duplicate records. It's criteria for removing duplicates may not match your own. It will only remove "duplicate" nodes if they have the same UID (they could conceivably represent the same paper, and have different UIDs, depending on how clean ISI data is). Only works on ISI data.

Applications

Used when you combine two queries from ISI into one. For instance, you can combine the results from "Grey Squirrel" with the results for "Red Squirrel" in a single file, and remove the duplicate publications using this algorithm.

Usage Hints

Will be indirectly employed if you choose the "Load and Clean ISI data" from the File menu.

  • No labels