Content scraper with url crawling (PHP and crons)

Tildelt Lagt ut May 20, 2012 Betales ved levering
Tildelt Betales ved levering

I need a couple of content scrapers based on PHP that will work with automatic crons. The functionalities of these scrapers must be at lease:

1. Easy to manage: in an easy administrator overview dashboard I can add the url that must be scraped. Now I can add the classes on the site that I will scrape and select from a drop down menu in what database colomn I will add the content.

* For example: I will scrape the title of a site article. For this I add the class 'class=title' and select the database colomn 'colomn1'.

* Multiple colomns (at lease 10) must be available

2. Automatic scan the whole website: when I like that the script will crawl the whole website I can select this option in the administrator panel. The script will now crawl every url that correspondent with the url I have submit, and check if the there is corresponding data to add to my database.

* Check for dubble entries: the script check for dubble entries. Dublicated content will not be placed in the database.

* Add the url to the database: the urls where the data is scraped must be added to every row of content in the database so the script can check if the url is already crawled.

* Check daily for new updates: the script can check the sites daily for new content. So when the URL has new articles on the site or new products, the script will automatic pick these items and add the content to my database.

* Dates: the database must include a date column so I can see when the data is scraped.

* Use random IP addresses and automatically fill this: the system must have a separated database for IP addresses. A script must update the IP list daily and scrape new IP addresses from websites. Also open source IP website should be used to update the list.

○ The scraping scripts must use the random IP addresses to scrape data.

My budget is low so don't bid more than the project budget. Also, only bid and send PM when you can do the job in max. 7 days.

NOTE: Only bid when you have read the project details. Don't send messages with all kinds of example links you build before that are not relevant. Only send project relevant messages or I will report the messages as spam.

Thanks already for the replies. Hope to find a long relationship development partner.

Best regards

Grafisk design HTML MySQL PHP Nettsidedesign

Prosjekt-ID: #1645861

Om prosjektet

8 bud Eksternt prosjekt Aktiv May 21, 2012

8 frilansere byr i gjennomsnitt €196 for denne jobben

SigmaVisual

We can help in your project, please check PMB and our ratings/reviews to get idea of our experience.

€225 EUR på 4 dager
(249 Omtaler)
7.8
srinichal

I am an expert in crawlers and can deliver the script

€250 EUR på 6 dager
(88 Omtaler)
7.0
phpXpertbd

I specialize in similar projects. Please check PM for more details.

€250 EUR på 10 dager
(19 Omtaler)
5.4
tamrakar81

thanks for invitation . We can do this in C# . Regards,

€200 EUR på 3 dager
(38 Omtaler)
5.1
raul27868

Hello, I can do this work for you and I'm ready to start. Please see pmb for details. Regards Raul

€120 EUR på 7 dager
(8 Omtaler)
4.5
capricetech

Hi, Thanks for your invitation . We are very keen to accomplish your project. Please check your Personal message board for in details discussion .

€250 EUR på 7 dager
(9 Omtaler)
4.4
bappyiub80386

Sir I am expert in website scrapping. Did many jobs in website scrapping.

€30 EUR på 10 dager
(3 Omtaler)
2.9
mediabeams

Top professionals mediabeams are ready. please check your PMB. regards teams mediabeams.

€245 EUR på 10 dager
(3 Omtaler)
2.5