所有软件外包项目 Gray arrow bg web crawler app needed!

web crawler app needed! 资金已经托管 线上项目,线下洽谈,智城安排

发包方 : Dorothy robinson 状态 :竞标已结束
项目编号 : 97014
项目预算 : $1,000-5,000
开发周期 : 7 天
技能 : Java Perl C
发布日期 : 2010-02-11

描述

I need a script developed that will pull contact information, e-mail, name, and phone numbers from websites. There are 50 high level sites (all links I have) that link to other specific organization sites of which I need to pull the contact information from. This script has several functions 

1) would check if there is a robots.txt, if it exists, ignore the site and log it

2) go to each sub organizations website (from the 50 high level sites) and grab the contact information for specific contacts/titles (you can pull all available if it's easier)

3) need to record as much information as possible about each contact including title, the organization name, address, e-mail, phone, etc

4) load the contacts into a csv file - will generate 50 csvs - one for each of the 50 high level sites  

This can be developed in perl, java, or c++. And needs to be completed in a maximum of 3 weeks. If anything is unclear I am happy to answer more questions.

竞标

请您先登录,然后提交此项目的竞标方案。
还不是智城用户? 智城期待您的加入,请注册成为我们的一员吧!
Project ad2