115th U.S. Congress Member Website (Full JavaScript-enabled Scrape) Collection
Creators
Description
This data set represents a point-in-time full JavaScript-enabled scrape of all available 115th U.S. Congress member web sites. The data collection originated and completed on 2018-04-13 and the results are in ndjson/jsonlines/streaming JSON format. File format information is in the enclosed README.md file.
The data was used to evaluate the privacy profiles of each U.S. Congress members' official (.gov hosted) websites for the discussion in <https://rud.is/b/2018/04/13/does-congress-really-care-about-your-privacy/>.
ScrapingHub's "Splash" platform (<https://github.com/scrapinghub/splash>) was used along with the "splashr" R package (<https://github.com/hrbrmstr/splashr>) to retrieve the content.
Files
LICENSE.txt
Additional details
Related works
- Is referenced by
- https://archive.org/services/purl/purl/hrbrmstr/2018-04-13-congress-privacy (URL)