所有软件外包项目 Gray arrow bg Mlt.thr spider with asp.net c#

Mlt.thr spider with asp.net c# 资金已经托管 线上项目,线下洽谈,智城安排

发包方 : Sandra walker 接包方 : Elance_jon 状态 :完成
项目编号 : 95822
项目预算 : $1,000-5,000
开发周期 : 7 天
技能 : ASP C# Search C
发布日期 : 2010-01-23

描述

I need an asp.net application using C# to crawl preset web pages and extract specific content from the source of each of those pages.


Frontend:


A web page with a textbox and a submit button. When the user clicks submit, you will crawl the sites(URLs will be provided) and extract the results. source code of the web pages must be read using HttpWebRequest and HttpWebResponse. This process should be multi threaded and the results should be displayed as they are processed. The processing/wait icon must be displayed while the results are being awaited. The html sources of webpages are saved on the server. Once the web page is read, the required content must be extracted using specific rules for each page. There are 6 pages to read per search. The result must be cached on the server for 24 hours and a URL rewrite engine must be configured such that pages are cached in this format - http://yourserver.com/keyword/



I would prefer that you use an already existing crawler like searcharoo or zeta web spider ( http://www.codeproject.com/KB/aspnet/ZetaWebSpider.aspx ) or this http://www.vsj.co.uk/articles/display.asp?id=402. Please mention in PMB as to how you plan to do this. You will need a test server online to show me the work before i make the payment.


竞标

请您先登录,然后提交此项目的竞标方案。
还不是智城用户? 智城期待您的加入,请注册成为我们的一员吧!
Project ad2