Need help in file search engine crawler, PHP Web Programming

Assignment Help:

Need help in File search engine crawler

Want a web crawler to gather data on an continuous basis from different file hosting services and filter it.

1) Data will be used to provide a files search engine to users. I don't want a clone of any similar kind website but infect something better than that one.

2) Process of data get-together will be a continuous process and will be running on the servers 24x7.

3) Data must be properly filtered to erase duplicate entries and new entries of the same data should merge.

I want the crawler itself part of the websites

It has to crawl multiple websites through given xml feed. Crawler must detect all links to files on rapidshare.com, 4shared, uploading as well as all other file sharing hosts. After detection it must add those links found to some kind of database (prefer MySQL) with addition information found about file Like Meta description, size, title, date if available.

Desired Skills are PHP5, cURL, MySQL Programming


Related Discussions:- Need help in file search engine crawler

E-commerce platform on website app, E-Commerce platform on website App H...

E-Commerce platform on website App Hello creative web developers as well as programmers. I am observing to create a new fashion / website / platform, I want it to be connecte

Want to know php and mysql and be able to debug, Update PHP code to work wi...

Update PHP code to work with 5.4+ Want to know PHP and MySQL and be able to debug and solve errors. We have an old CRM engine developed back in '06/07 as well as no longer wo

We are seeking of joomla template conversion, We are seeking of Joomla Temp...

We are seeking of Joomla Template Conversion I need to convert the template into Joomla format. The deliverables here are- a) The ENTIRE template including all the pages w

We need a chief technical expert, We need a Chief Technical expert I am ...

We need a Chief Technical expert I am looking for a developer expert for a new digital book publishing start-up. As you may know the publishing industry is changing rapidly as w

Protection against suspicious data, Input values embedded in SQL statements...

Input values embedded in SQL statements should be screened for inappropriate characters that can form the basis so-called SQL Injection attacks, a type of security attack that may

Website creation and scraping, I have a three stage project related to prep...

I have a three stage project related to preparing a website: 1) Scraping data from few websites. I will identify which ones and which data once your selected. 2) Modifying th

Need help in a rent a snow scooter site, Rent a snow scooter site I need...

Rent a snow scooter site I need an html/css and php developer for a project of snow scooter rental site. Entirely graphical design of subpages will be provided. I only want a pr

Need help for turn a paper form into an html5 form, Need help for Turn a pa...

Need help for Turn a paper form into an HTML5 form I have a paper order-sheet that desires to be transformed into a responsive html5 form that submits to order.do.php. The field

How can images be optimized for use on web pages, Question: (a) Write ...

Question: (a) Write the HTML tags for the following text: Software List A. Module 1 • Adobe Illustrator CS3 B. Module 2 • Adobe Photoshop CS3 C. Module 3 • Maya •

Dns server, Several computers linked to the Internet host part of the DNS d...

Several computers linked to the Internet host part of the DNS database & the software which allows others to access it. These all computers are known DNS servers. No DNS server has

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd