How to extract and download data from the Local Systems database.
Data may be downloaded as XML in the same format used to harvest the local systems
Section MXD below
Data may be downloaded as JSON in the same format we use to build the portal
Section JSON Data below
MXD
DDF-MXD is the Danish Research Database Metadata Exchange Format for Documents.
It is the only format supported for adding data to NORA Local Systems portal.
The JSON data is an internal representation of the MXD data.
It is subset of the MXD data fields, only those which are useful and supported by the portal.
It does however contain enhancements, primarily the normalisation of some of the data like Danish organisation names, locations, OAI Scope, etc.
View JSON fields description
[
{
"abstract" : "", // - Abstract text, in English if available
"author_count" : "", // - Authors count, based on authors in that record (since 2024-02)
"bfi_cno" : "", // - BFI conference number, old records only
"bfi_pno" : "", // - BFI publisher number, old records only
"bfi_sno" : "", // - BFI serial number, old records only
"bfi_subject" : "", // - BFI expert group, old records only
"cluster_size" : "", // - Number of data providers in cluster, cuid, 1, 2, 3, 4, 5+ (since 2024-02)
"cuid" : "", // - Cluster ID (since 2023-10 experimental, 2024-02 production)
"dimID" : [""], // - List of Dimensions IDs for this record
"event" : {
"acronym" : "", // - Event acronym
"end" : "", // - Event end date
"place" : "", // - Event location
"start" : "", // - Event start date
"title" : "", // - Event full name
"year" : "" // - Event year
},
"filters" : { // - Filters are generally for internal indexing use but some of the detailed
// information can be generally useful
"added" : "", // - When the record was added to the research portal (YYYY-MM, Since 2023-03)
"altsrc" : [""], // - List of global sources in which this record can be found
"author_count" : "", // - A filter based on clustering of author_count [1, 2-5, 6-10, 11-50, 51-200, 201-1000,
// 1001-]. Mostly useful for Search filtering.
"continent" : [""], // - List of geographic regions of organisations collaborating to this publication
"corpau" : [""], // - List of corporate authors
"country" : [""], // - List of countries of organisations collaborating to this publication
"glorg" : [""], // - List of global organisations collaborating to this publication. Also includes
// "All Global Affiliations" for search purpose.
"keyword" : [""], // - A cleaned up list of keywords for filtering
"level" : "", // - MXD doc_level
"mra" : "", // - MXD research area (code)
"oa" : "", // - The type of Open Access to this record
"oaiscope" : "", // - Boolean Yes or No, defines if this record is part of OA-Indicator.
"org" : {
"level1" : [""], // - Level 1 Danish orgs for filtering only, always di
"level2" : [""], // - Level 2 Danish orgs for filtering only, see codes below in level 3 (they have a DI- prefix
// in level2
"level3" : { // - Lists of org codes, split by org types level2.
"ART" : [""], // Artistic Higher Education Institutions (since 2023-01)
"BA" : [""], // Business Academies (since 2023-01)
"GTS" : [""], // GTS Institutes (since 2023-01)
"GV" : [""], // Governmental Institutions (since 2023-01)
"HO" : [""], // Hospitals
"NP" : [""], // Non-Profit Organisations (since 2023-01)
"OT" : [""], // Other Research
"PR" : [""], // Private Research
"SE" : [""], // Specialized Research
"U" : [""], // Universities
"UC" : [""] // University Colleges
},
"level3all" : [""] // - All level3 orgs for new national collaboration filter (since 2024-02)
},
"review" : "", // - Review codes, direct from MXD
"rtype" : [""], // - Record type, sin: single, clu: cluster, or both for records which are the same
// for both (since 2023-06 experimental, 2024-02 production)
"sorti" : "", // - Sort title (since 2024-02)
"src" : "", // - Data source, direct from MXD
"src-code" : "", // - NORA for for the source "src" (since 2023-07)
"type" : "", // - Record type, direct from MXD
"type-grp" : [""] // - Type group, based on type. So that the only active group is pub. You may also
// find dat (data), pat (patent) and all.
},
"ft" : { // - The best fulltext for that record (since 2024-02)
"access" : "", // - access type: oa or un for Open Access and Undefined
"license" : "", // - mostly free text
"text" : "", // - free text description
"type" : "", // - doi (DOI based), loc (Local fulltext), rem (Remote fulltext), url (Unknown)
"url" : "", // - The fulltext URL
"version" : "" // - Pre-print, Authors Final Version, Published Version, Other Version
},
"id" : "", // - Original UUID
"identifier" : [
{
"id" : "", // - Some form of ID
"type" : "" // - Types, so far: scopus, pmid, wos
}
],
"inetpub" : [
{
"access" : "", // - Direct from MXD
"text" : "",
"type" : "", // - type from mxd (since 2024-02)
"uri" : "",
"url" : "", // - For URLs extracted from publication part of MXD
"version" : "" // - version from mxd (since 2024-02)
}
],
"keyword" : [""], // - Direct from MXD
"lang" : "", // - English text version of Language
"level" : "", // - English text version of MXD level
"mra" : "", // - English text version of MXD MRA
"noi" : "", // - NORA Object Identifier, for clustering between Local and Global data (since 2024-02)
"oa_link" : [
{
"access" : "", // - Direct from MXD
"embargo" : "",
"license" : "",
"type" : "",
"url" : "",
"version" : ""
}
],
"oai" : { // - Added OA-Indicator
"class" : "", // - OA-Indicator class (Blocked, Realised, Unused)
"mra" : "", // - Main Research Area
"yearPub" : "", // - Publication year
"yearSub" : "" // - Submission year
},
"org" : [
{
"affno" : "", // - Affiliation number for link with person below
"country" : "", // - Country code, 2 letters
"l2" : "", // - NORA level 2 code (org type)
"l2n" : "", // - Corresponding text in English
"l3" : "", // - NORA level 3 code (org code)
"l3nl" : "", // - Long English text version of l3
"l3nn" : "", // - More compact version of the text
"loc" : [""], // - Local country / regions
"name" : "", // - Full org name
"nora_code" : "", // - Same as l3
"role" : "" // - Can be used for "Corporate author"
}
],
"person" : [
{
"affno" : "", // - Affiliation number for link with org above
"country" : "", // - Country code, 2 letters
"locid" : "", // - Local ID from data provider (src)
"name" : "", // - Full name
"first" : "", // - First name
"last" : "", // - Last name
"orcid" : "", // - ORCID
"role" : "", // - So far: Editor, Editor of Host Publication, Illustrator, Inventor, Other,
// Publisher, Supervisor, Translator
"rolecode" : "" // - The original code from MXD (since 2023-03)
}
],
* "pmid" : "" , // - PubMed ID
"pub" : {
"doi" : "", // - DOI mostly document level but also sometimes higher publication level, like
// book for a book chapter.
"edition" : "", // - Direct from MXD
"eisbn" : [""], // - Electronic ISBN
"eissn" : [""], // - Electronic ISSN
"isbn" : [""], // - ISBN
"issn" : [""], // - ISSN
"issue" : "", // - Issue number, direct from MXD
"journal" : "", // - Journal title
"newspaper" : "", // - Newspaper title
"other" : "", // - Other publication (Direct from MXD)
"pages" : "", // - Number of pages or page range
"place" : "", // - Publication place
"publisher" : "", // - Publisher name
"repno" : "", // - Report number
"series" : "", // - Series title
"status" : "", // - Publication status code (direct from MXD)
"subtitle" : "", // - Sub-title
"title" : "", // - Publication title
"volume" : "" // - Volume
},
"review" : "", // - Review text
"scoID" : [""], // - ID of corresponding Scopus record
"sdg" : [""], // - List of SDG classifications
"search" : {
"DAF" : [""], // - Danish university code and text, for search purpose
"PST" : "" // - Publication status text, for search purpose
},
"sid" : "", // - A combination of src and id, used internally only so far
"src" : "", // - Data provider as abbreviated English text
"title" : "", // - Record title
"type" : "", // - Record type as English text
"url" : "", // - URL back to the record in research DB UI (since 2024-02)
"wosID" : [""], // - ID of corresponding Clarivate record
"year" : "", // - Publication year
"yearsub" : "" // - Submission year
}
]
Note: For the clustered version for JSON records:
1) The following fields have been removed:
- bfi_cno, bfi_pno, bfi_sno, bfi_subject (obsolete)
- inetpub (not relevant)
2) The following fields have changed from string to arrays of strings:
- cuid, filters/mra, filters/rtype, filters/src, mra, noi, oai/mra, search/PST, sid, src
3) The following fields have been added:
- contributors : an array of the most representative list of authors
- filters/class: with a value merged, this indicated if the record is the result of merging to or more records.
- ids : list of IDs from the original records
- org/key : a key used in merging organisations, this is an internal field which will be removed in the future.
- org/namevar : name variants from original records
- org/used : an internal field which like org/key will be removed in the future
- person/key : same as org/key
- person/rank : rank in the original record, mostly internal but could be useful.
Note 2: Noticed something in the JSON which does not match the description above, please report it to nora.info@dtu.dk and we
will do our best to quickly fix either the JSON or the documentation.
JSON Array
If you have enough memory to parse the whole JSON at once, you can use the following extracts which are formatted as JSON arrays of records.
JSON Record List
For easier parsing with more limited memory, you can use the following extracts which are formatted as lists of JSON records, one per line.