Data scrapping of a static website with three levels of depth. Final product to be delivered as a table, csv or other format.
## Deliverables
Request for Proposal: Data Scraping
We require some data scraping work to be completed by November 15th, 2009.
We require scraping of the following site and sub-pages:
[login to view URL]
Fields to scrape:
NPRI ID
Company Name
City
Province
Using the first company? (Vanderhoof Speciality Wood Products) as an example:
First Sub-Page to scrape:
[login to view URL]
Fields to scrape and place in table:
Substance Column (all rows)
Cas Number Column (all rows)?
This Facility Column (all rows)
Second Sub-Page to scrape:
[login to view URL]
Fields to scrape and place in table:
Street and Number
Street 2
City
Province
Postal Code
Number of employees
Website
Parent Company Information if applicable
Contact Information:
Name
Positions
Phone
Fax
email
Standard Industrial Classifications
Pollution Prevention PLanning
Comments