Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Code used to produce terms list in the work "NLP-Driven Electron Microscopy Ontology Development"

Metadata Updated: July 12, 2025

This is a collection of code written by Maurice Curran that was used to process the Microscopy and Microanalysis conference proceeding corpus into word products described in the publication "NLP-Driven Electron Microscopy Ontology Development". The scripts are written in Python, to be used in the following order:1. SettingUpTextFiles.py and CopyingText.py to get the raw text files; 2. SentenceConversion.py; 3. reference_remover.py; 4. testing.py and testingavg.py; 5. SentenceCreator.py; 6. matscholar_model.py to get matscholar tags; 7. training_model_gensim.py to get gensim model;8. word2vecscript.py and gensim_visual.py;

Access & Use Information

Public: This dataset is intended for public access and use. License: See this page for license information.

Downloads & Resources

References

https://doi.org/10.1007/s40192-024-00378-y

Dates

Metadata Created Date September 11, 2024
Metadata Updated Date July 12, 2025
Data Update Frequency irregular

Metadata Source

Harvested from NIST

Additional Metadata

Resource Type Dataset
Metadata Created Date September 11, 2024
Metadata Updated Date July 12, 2025
Publisher National Institute of Standards and Technology
Maintainer
Identifier ark:/88434/mds2-3198
Data First Published 2024-09-05
Language en
Data Last Modified 2021-12-31 00:00:00
Category Information Technology:Data and informatics, Materials:Modeling and computational material science, Materials:Materials characterization
Public Access Level public
Data Update Frequency irregular
Bureau Code 006:55
Metadata Context https://project-open-data.cio.gov/v1.1/schema/data.json
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id db9c03dc-92ac-4197-988e-5d48281cdbe0
Harvest Source Id 74e175d9-66b3-4323-ac98-e2a90eeb93c0
Harvest Source Title NIST
Homepage URL https://data.nist.gov/od/id/mds2-3198
License https://www.nist.gov/open/license
Program Code 006:045
Related Documents https://doi.org/10.1007/s40192-024-00378-y
Source Datajson Identifier True
Source Hash 4d49de9ba151d6163286ebf8918a4950a9a5bef84953c26c6001e7b87bb48600
Source Schema Version 1.1

Didn't find what you're looking for? Suggest a dataset here.