Child pages
  • Load NSF File Into Database

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3

...

Note: NSF format has pre-defined set of fields which are found in all the NSF files. They are as follows:

Code Block

Title, Principal Investigator, Program(s),  Organization Phone, State, Organization Zip, Program Element Code(s),  NSF Directorate, NSF Organization, Field Of Application(s), Organization  Street Address,
Program Manager, Organization State, Expiration Date,  Program Reference Code(s), PI Email Address, Organization, Award  Instrument, Awarded Amount to Date, Last Amendment Date, Co-PI Name(s),  Award Number,
Organization City, Start Date, Abstract.

The database schema we use in this NSF loader and our other NSF database-related algorithms takes into account only these fields. Although any "arbitrary" fields found will merely be appended to the AWARD table and will not be reflected in the schema structure on a high level. Also many times due to CSV corruption we merge all the columns right to the abstract column that leads to "Abstract" being broken into multiple columns.

...

The following is a list of NSF database tables that link to pages describing their fields and how they are parsed out of NSF datasets: