Skip to main content
U.S. flag

An official website of the United States government

Code used to produce terms list in the work "NLP-Driven Electron Microscopy Ontology Development"

Published by National Institute of Standards and Technology | National Institute of Standards and Technology | Catalog Last Checked: August 02, 2025 at 03:33 PM | Dataset Last Updated: December 31, 2021
This is a collection of code written by Maurice Curran that was used to process the Microscopy and Microanalysis conference proceeding corpus into word products described in the publication "NLP-Driven Electron Microscopy Ontology Development". The scripts are written in Python, to be used in the following order:1. SettingUpTextFiles.py and CopyingText.py to get the raw text files; 2. SentenceConversion.py; 3. reference_remover.py; 4. testing.py and testingavg.py; 5. SentenceCreator.py; 6. matscholar_model.py to get matscholar tags; 7. training_model_gensim.py to get gensim model;8. word2vecscript.py and gensim_visual.py;

Resources

1 resource available

  • NLP code to produce words about electron microscopy

    APPLICATION/ZIP

Find Related Datasets

Search by Tags

Click any tag below to search for similar datasets

data.gov

An official website of the GSA's Technology Transformation Services

Looking for U.S. government information and services?
Visit USA.gov