Web-crawling & scraping of Interviews with film industry professionals

Lukket Lagt ut May 5, 2016 Betales ved levering
Lukket Betales ved levering

This project involves locating interviews with particular film industry professionals (directors, producers and actors/actresses) from a defined list of websites/magazines/newspapers, scraping the text of each interview and storing it in a separate text file (using the following naming convention: [personID][interview number (001-xxx)].txt).

At the same time, you will keep track of the interviews you found in a master list (columns are: Interview number, url of where the interview is located, date of the interview [if available]). You are asked to collect a minimum of 10 interviews per person available from a shortlist of sources.

The project will consist of the following steps for each list of persons:

1. Determine method of access to the intended data sources (websites/magazines/newspapers), for which we will provide a list of 10.

2. Query these sources for the persons on the list, examine if interview contains evidence of the interviewee being quoted (quotation marks in combination with prose, name in combination with verb indicative of speech) and scrape the interview if it meets the aforementioned criterion. Save the speech parts of the interview in a secondary file.

3. Supplement where necessary with top hits in a Google search (name person + interview), determine which of these are from sources not included in the list used for step 1, and execute Step 2 on the additional sources found until a sufficient number of interviews per person is reached.

Lists include:

1. Directors: 306 (overlaps with producers for 110 professionals)

2. Producers: 418 (overlaps with directors for 110 professionals)

3. Actors/actresses: 697

The person taking this job has experience with web crawlers and text scraping, can work with a wide range of source material for text scraping, has a proactive attitude and is a creative problem-solver.

If this is you, we look forward to your application.

Dataregistrering Data Mining Databehandling Nett-scraping Nettsøk

Prosjekt-ID: #10421921

Om prosjektet

18 bud Eksternt prosjekt Aktiv 7 år siden

18 frilansere byr i gjennomsnitt $466 for denne jobben

Marie1234

A proposal has not yet been provided

$315 CAD på 1 dag
(275 Omtaler)
7.5
diamond247

We are a team (19 operators) here, giving all data entry, research and scraping service world wide with best quality output ,gone through your project description, we are experienced enough to collect the data from sev Mer

$400 CAD på 10 dager
(255 Omtaler)
7.2
Verz1Lka

Hello! I'm web scraping expert and i can done your project. I use python language and scrapy framework. My scripts works on windows, mac or linux, but linux is preferably. I can schedule scripts on server if it is req Mer

$399 CAD på 10 dager
(107 Omtaler)
6.5
seoguru17

Wide experience in Research and Scraping.I have gone through from your job posting and i would like to discuss few questions that i have

$333 CAD på 10 dager
(95 Omtaler)
6.3
vlayausa

Hello, I am interested in this project. I am looking forward to working on it, because it is connected with film industry, and I love watching movies and I know a lot about actors, directors and other things... Mer

$300 CAD på 7 dager
(93 Omtaler)
5.5
sylar1015

hello, sir: c/c++/python expert worked for samsung & huawei maybe more details will be helpful a sample can be provided before hired. hope to get message from u ty

$400 CAD på 10 dager
(16 Omtaler)
4.7
jinigo23

Hi there! I've good knowledge and experience in this kind of work. I will complete your project within the time stated. I am ready to start the work immediately. I will do my best. I've added some of my previous projec Mer

$340 CAD på 10 dager
(10 Omtaler)
5.0
Elsa22

Hello, I am a freelancer from Sweden. I have done similar projects on data entry and Web scrapping. Moreover I have got good feedbacks too. So if you are willing to hire me I will assure to do a quality work as I have Mer

$500 CAD på 10 dager
(13 Omtaler)
4.4
dghq123

Hello, I am computer science grad and had done many scrapping projects. let me know the URL. i can start now.

$555 CAD på 10 dager
(4 Omtaler)
4.0
mike199

My name is Mike and I’m from UK. I work with individual clients and also provide outsourcing services for a number of UK and USA based agencies. Your project description sounds interesting to me and I do have skills & Mer

$555 CAD på 10 dager
(0 Omtaler)
0.0
SharePointExper

Hey, this seems like a very interesting project. I can help you with this with a powerful took which I've designed for these types of projects. Specifically to extract the text from a source and send it to a second fil Mer

$600 CAD på 4 dager
(0 Omtaler)
0.0
Shopify

I want to discuss this project with you further, let me know the best suitable time for you to schedule the meeting, Feel free to message me at any time, i used to be online 14 hrs in a day on this website so probably Mer

$773 CAD på 20 dager
(0 Omtaler)
3.4
ashanfw

Hi, I'm an IT guy by trade and handle most aspects of IT in general from data entry to network and server administration. I would like to work for myself and on my own terms and shall use my ever growing profession Mer

$388 CAD på 10 dager
(0 Omtaler)
0.0