NYC STEW-MAP Staten Island organizations' website hyperlink webscrape
Resources
4 resources available
-
NYC Staten Island STEW-MAP hyperlink webscrape_README.txt
TEXT/PLAIN -
NYC Staten Island STEW_MAP hyperlink webscrape reduced network dataset_revised.csv
TEXT/CSV -
NYC Staten Island STEW_MAP hyperlink webscrape_all.csv
APPLICATION/VND.MS-EXCEL -
NYC Staten Island STEW_MAP hyperlnk webscrape reduced network dataset_Metadata.txt
TEXT/PLAIN
Find Related Datasets
Search by Tags
Click any tag below to search for similar datasets
Complete Metadata
| @type | dcat:Dataset |
|---|---|
| accessLevel | public |
| bureauCode |
[ "020:00" ] |
| contactPoint |
{ "fn": "Jesse Sayles", "hasEmail": "mailto:sayles.jesse@epa.gov" } |
| description | The data represent web-scraping of hyperlinks from a selection of environmental stewardship organizations that were identified in the 2017 NYC Stewardship Mapping and Assessment Project (STEW-MAP) (USDA 2017). There are two data sets: 1) the original scrape containing all hyperlinks within the websites and associated attribute values (see "README" file); 2) a cleaned and reduced dataset formatted for network analysis. For dataset 1: Organizations were selected from from the 2017 NYC Stewardship Mapping and Assessment Project (STEW-MAP) (USDA 2017), a publicly available, spatial data set about environmental stewardship organizations working in New York City, USA (N = 719). To create a smaller and more manageable sample to analyze, all organizations that intersected (i.e., worked entirely within or overlapped) the NYC borough of Staten Island were selected for a geographically bounded sample. Only organizations with working websites and that the web scraper could access were retained for the study (n = 78). The websites were scraped between 09 and 17 June 2020 to a maximum search depth of ten using the snaWeb package (version 1.0.1, Stockton 2020) in the R computational language environment (R Core Team 2020). For dataset 2: The complete scrape results were cleaned, reduced, and formatted as a standard edge-array (node1, node2, edge attribute) for network analysis. See "READ ME" file for further details. References: R Core Team. (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/. Version 4.0.3. Stockton, T. (2020). snaWeb Package: An R package for finding and building social networks for a website, version 1.0.1. USDA Forest Service. (2017). Stewardship Mapping and Assessment Project (STEW-MAP). New York City Data Set. Available online at https://www.nrs.fs.fed.us/STEW-MAP/data/. This dataset is associated with the following publication: Sayles, J., R. Furey, and M. Ten Brink. How deep to dig: effects of web-scraping search depth on hyperlink network analysis of environmental stewardship organizations. Applied Network Science. Springer Nature, New York, NY, 7: 36, (2022). |
| distribution |
[ { "title": "NYC Staten Island STEW-MAP hyperlink webscrape_README.txt", "mediaType": "text/plain", "downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1522542/NYC%20Staten%20Island%20STEW-MAP%20hyperlink%20webscrape_README.txt" }, { "title": "NYC Staten Island STEW_MAP hyperlink webscrape reduced network dataset_revised.csv", "mediaType": "text/csv", "downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1522542/NYC%20Staten%20Island%20STEW_MAP%20hyperlink%20webscrape%20reduced%20network%20dataset_revised.csv" }, { "title": "NYC Staten Island STEW_MAP hyperlink webscrape_all.csv", "mediaType": "application/vnd.ms-excel", "downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1522542/NYC%20Staten%20Island%20STEW_MAP%20hyperlink%20webscrape_all.csv" }, { "title": "NYC Staten Island STEW_MAP hyperlnk webscrape reduced network dataset_Metadata.txt", "mediaType": "text/plain", "downloadURL": "https://pasteur.epa.gov/uploads/10.23719/1522542/NYC%20Staten%20Island%20STEW_MAP%20hyperlnk%20webscrape%20reduced%20network%20dataset_Metadata.txt" } ] |
| identifier | https://doi.org/10.23719/1522542 |
| keyword |
[ "SNA", "Social network analysis", "decision support tools", "environmental governance", "environmental stewardship", "hyperlink networks", "web-scraping" ] |
| license | https://pasteur.epa.gov/license/sciencehub-license-non-epa-generated.html |
| modified | 2021-05-26 |
| programCode |
[ "020:000" ] |
| publisher |
{ "name": "U.S. EPA Office of Research and Development (ORD)", "subOrganizationOf": { "name": "U.S. Environmental Protection Agency", "subOrganizationOf": { "name": "U.S. Government" } } } |
| references |
[ "https://doi.org/10.1007/s41109-022-00472-0" ] |
| rights |
null
|
| title | NYC STEW-MAP Staten Island organizations' website hyperlink webscrape |