I need a script developed that will pull contact information, e-mail, name, and phone numbers from websites. There are 50 high level sites (all links I have) that link to other specific organization sites of which I need to pull the contact information from. This script has several functions
1) would check if there is a robots.txt, if it exists, ignore the site and log it
2) go to each sub organizations website (from the 50 high level sites) and grab the contact information for specific contacts/titles (you can pull all available if it's easier)
3) need to record as much information as possible about each contact including title, the organization name, address, e-mail, phone, etc
4) load the contacts into a csv file - will generate 50 csvs - one for each of the 50 high level sites
This can be developed in perl, java, or c++. And needs to be completed in a maximum of 3 weeks. If anything is unclear I am happy to answer more questions.
接包方 | 国家/地区 | |
---|---|---|
3
Sdenhartog
|
||
3
Mpla
|
||
3
Twentyzero
|
||
3
Vmk13
|
||
3
M4rkk
|
||
2
Reckon5
|
||
2
Belprog
|
||
2
Freeant
|
||
0
Patterncat
|
||
0
Ftraveller
|