Using google spreadsheet to crawl our site

  • Status: Closed
  • Premie: $150
  • Mottatte bidrag: 7
  • Vinner: erdyse

Konkurransesammendrag

We are looking for the best google spreadsheet solution for crawling our websites and checking for specific tags and values. These values and tags must then be compared to a template. Your mission is to provide the best idea for this. The winner will get the assignment to implement the solution for us.

What we need

Your solution should manage up to 1 000 URL:s that needs to be checked, for each of our 9 websites (a total of 11 000 URL:s). Each of them needs to be checked that they have the correct tag. The check needs to be automatic, and we need to receive an alarm in the spreadsheet and by email if the correct tag is not in the url.

The tags we are checking for are canonical tag, metarobots and hreflang-tag.

There are many ways to set this up, and your mission is to find the best, easiest and most logical way.

Requirement for your solution

> There can be no installation of any software anywhere
> The solution must work directly in the spreadsheet
> If you choose javascript, that too must be directly in the spreadsheet
> We can’t be dependant on any third part
> We want only one column for “Status” and an then an IF-message that lets us know what the error is

Examples

Here are some examples of our URLS for our swedish site

https://www.footway.se/skor/herr/kangor-boots
https://www.footway.se/skor/kangor-boots/chelsea-boots
https://www.footway.se/skor/herr/kangor-boots/chelsea-boots
https://www.footway.se/skor/kangor-boots/chelsea-boots
https://www.footway.se/skor/herr/kangor-boots/chelsea-boots

You can also find the urls and current setup in this test file:

https://drive.google.com/drive/folders/18g2Pg_Zor1LDRgs_zZyO1Vo0Vk2jU7Na

The solution you provide needs to check these urls for the correct canonical tag, correct metarobot tag and correct hreflang tag. And then return a status message in a single column if all tags are met and correct. If not, the message must say which of the tags are not met.

It is up to you how to structure and present the solution for this. Keep in mind that there are 1 000 urls for 9 different websites.

Examples of what we want

A google spreadsheet solution
A solution with no installation of any software
An easy structure that is quick to understand
Google script
A crawler
Webscraping

Examples of what we don’t want

A programme that needs to be installed
An advanced programme that needs lots of manual work
A subscription to any seo service
Your own developed programme that requires installation

Make sure you follow instructions closely. Read them again now. And once more after. Do not start writing before you have a clear understanding for the guidelines. We will reject any entry that does not follow instructions. We also reject any entries telling us you're working on the project. For long term collaboration we highly value the ability to follow instructions.

We have limited capacity to answer questions or give individual feedback during the project, instead we will give collective feedback - this will then hopefully help you get a better understanding of what we are looking for.

The winner gets 150 USD, and an additional 100 USD later on for implementing the solution for us.

Anbefalte ferdigheter

Arbeidsgivers tilbakemelding

“Victor A presented a clear and precise idea for how to solve our tag checking problem. Great work.”

Profilbilde alexanderaberg, Sweden.

Beste bidrag i denne konkurransen

Se flere innlegg

Offentlig avklaringstavle

  • brohimumtaz
    brohimumtaz
    • 4 år siden

    kindly see my Portfolio

    • 4 år siden
  • alexanderaberg
    Konkurranseholder
    • 4 år siden

    I see some recommendations of =IMPORTXML. This is a good idea and we have tried it. But the load of so many urls becomes to big for spreadsheet. It does not work with so many urls.

    If you can provide an idea around this, we'd be very glad!

    • 4 år siden
  • erdyse
    erdyse
    • 4 år siden

    Thank you. Will submit soon.

    • 4 år siden
  • erdyse
    erdyse
    • 4 år siden

    Your sites have multiple hreflang tags and yet your template has only one expected url for hreflang tag to compare to.

    • 4 år siden
    1. alexanderaberg
      Konkurranseholder
      • 4 år siden

      See my answer above.

      • 4 år siden
  • erdyse
    erdyse
    • 4 år siden

    Secondly, your brief does not explain what we are to check specifically for meta robots tag

    • 4 år siden
    1. alexanderaberg
      Konkurranseholder
      • 4 år siden

      See my answer above.

      • 4 år siden
  • alexanderaberg
    Konkurranseholder
    • 4 år siden

    As Victor asked below, we have several hreflang tags and same for meta robots. Remember that this contest is not about doing the crawling, it us about providing the easiest and best idea of HOW to do this. So don't spend time on the actual crawling now.

    • 4 år siden

Vis flere kommentarer

Hvordan å komme i gang med konkurranser

  • Legg ut din konkurransen

    Legg ut din konkurranse Raskt og enkelt

  • Få mange bidrag

    Få mange bidrag Fra hele verden

  • Kår det beste bidraget

    Kår det beste bidraget Last ned filene - Enkelt!

Legg ut en konkurranse nå eller bli med i dag!