Web-crawling & scraping of Interviews with film industry professionals
$250-750 CAD
Betales ved levering
This project involves locating interviews with particular film industry professionals (directors, producers and actors/actresses) from a defined list of websites/magazines/newspapers, scraping the text of each interview and storing it in a separate text file (using the following naming convention: [personID][interview number (001-xxx)].txt).
At the same time, you will keep track of the interviews you found in a master list (columns are: Interview number, url of where the interview is located, date of the interview [if available]). You are asked to collect a minimum of 10 interviews per person available from a shortlist of sources.
The project will consist of the following steps for each list of persons:
1. Determine method of access to the intended data sources (websites/magazines/newspapers), for which we will provide a list of 10.
2. Query these sources for the persons on the list, examine if interview contains evidence of the interviewee being quoted (quotation marks in combination with prose, name in combination with verb indicative of speech) and scrape the interview if it meets the aforementioned criterion. Save the speech parts of the interview in a secondary file.
3. Supplement where necessary with top hits in a Google search (name person + interview), determine which of these are from sources not included in the list used for step 1, and execute Step 2 on the additional sources found until a sufficient number of interviews per person is reached.
Lists include:
1. Directors: 306 (overlaps with producers for 110 professionals)
2. Producers: 418 (overlaps with directors for 110 professionals)
3. Actors/actresses: 697
The person taking this job has experience with web crawlers and text scraping, can work with a wide range of source material for text scraping, has a proactive attitude and is a creative problem-solver.
If this is you, we look forward to your application.
Prosjekt-ID: #10421921
Om prosjektet
18 frilansere byr i gjennomsnitt $466 for denne jobben
We are a team (19 operators) here, giving all data entry, research and scraping service world wide with best quality output ,gone through your project description, we are experienced enough to collect the data from sev Mer
Wide experience in Research and Scraping.I have gone through from your job posting and i would like to discuss few questions that i have
hello, sir: c/c++/python expert worked for samsung & huawei maybe more details will be helpful a sample can be provided before hired. hope to get message from u ty
Hello, I am computer science grad and had done many scrapping projects. let me know the URL. i can start now.
Hey, this seems like a very interesting project. I can help you with this with a powerful took which I've designed for these types of projects. Specifically to extract the text from a source and send it to a second fil Mer