Want to 10x your blog traffic?

4 days to learn the fundamentals of SEO and grow your blog with free, consistent traffic from Google.

  • Find a niche with search demand
  • Target the right keywords for your business
  • Create SEO-rich content
  • Essential link building strategies in SEO
link building scrapebox

How to Build Backlinks with Content Scraping (+ Scrapebox Tutorial)

leannewong-profile-2018

Link building is no easy feat.

In an ideal world, backlinks come to you naturally. Visitors find your blog post and love it so much, they share it on their social networks and even link to you on their website.

That’s a backlink by the way — a hyperlink from someone else’s site to your own.

But in reality, that just doesn’t happen. Or it doesn’t happen as often as you’d like.

You need to deliberately get your content seen, by putting it in front of people and get that visibility. In other words, to build backlinks, we have to do outreach.

Link-based metrics are the #1 most important factor in SEO. The more backlinks you have (aka the more websites are linking to you, the more authoritative Google thinks your site is and will rank it higher on its search results).

Examples of link building strategies: guest posting, being featured in roundup posts and broken link building.

Here’s the caveat: building backlinks takes A LOT of time:

  • Finding potential link prospects in niche relevant blogs
  • Hunting for email addresses to reach out to
  • Creating outreach emails and pitches

So today, I’m going to share a process I use for link prospecting and outreach that will save you hours of time.

Introducing Scrapebox: A Powerful Link Prospecting Tool

Once in a while you might chance upon an SEO tool that made you wonder what you did before you discovered it. Scrapebox is that tool, folks.

Scrapebox was originally a “blackhat” SEO tool designed for large-scale blog commenting or spam. What’s often missed out is the incredible scrapping features Scrapebox has that can be used for link prospecting, without spamming a site.

It’s considered the ‘Swiss Army Knife of SEO’ and should be in every marketer’s arsenal. If you don’t have a copy of Scrapebox yet, it’s only $97 for lifetime access — you need to get it right now.

When you first open Scrapebox, you might be a little confused on how to start. Don’t worry, you only need to learn a few essential steps to make the most out of it.

The first is the ‘custom footprint’.

scrapebox-urls-harvesting-seo

Click on the ‘custom footprint’ radio button on the top left box. This tells Scrapebox to search through all the websites possible on the net. #Scraping.

Next, enter some search modifiers into the box. For this example, we are going to look for all the yoga blogs that accept guest authors. To do that, I’ve entered the following queries:

  • “yoga” inurl:tag/guest
  • “yoga” intitle:”write for us”
  • “guest post” intitle:”yoga”

Finally, click on ‘start harvesting’ and within 20 seconds you’ll have a ton of possible link prospect opportunities. (I gathered 999 URLs from my queries!).

ScrapeBox can scrape search engine result pages (SERPs) at an incredibly fast rate.

Our example above was scrapping 119 URLs per second! This makes the manual process of gathering link prospects one by one a hell of a lot quicker.

Note that if you choose to do some heavy-scrapping, you’d need to get private proxies to make the most out of this tool.

Step 2: Trim the Fat and Remove Duplicates

Next we want to refine our huge list of results and remove unsuitable prospects. Scrapebox will return a lot of duplicate results and the first thing we want to do is ‘remove duplicate urls’.

scrapebox-remove-duplicates

 

Then, ‘remove duplicate domains’. This step is much more brutal and will probably trim down your scrape by 40% or more.

Finally, we want to weed out the useless sites such as free blogging platforms. You can do this via excel or use the function within Scrapebox, under ‘Remove/Filter’, choose ‘remove URLs containing’:

  • wordpress.com
  • weebly
  • blogspot
  • blogger
  • tumblr
  • squarespace

Step 3: Checking Page Authority on Scrapebox

You can bulk check link equity metrics of the URLs using Scrapebox’s ‘Page Authority’ addon.

To use this add-on, you have to sign up for a free or paid Mozscape API key.

MozRank is a score from 1–10 which measures a URL’s link popularity.

Now the free version of Mozscape has a 10 second delay so the full crawl might take 30–50 minutes depending on how many URLs you’ve harvested.

Let Scrapebox do its thing and after its completed, you’ll have the page authority metrics of all your URLs — WHEEE!

Also, you can export the data in .csv format using the ‘export results as’ button.

mozrank-scrapebox-checker

After exporting the .csv file, you want to sort your data.

  • Filter out URLs with 0 MozRank, these are likely low quality blogs

The URLs remaining would meet our minimum quality criteria — we have a total of 562 potential link prospects.

Now we just need to sift through them to find the good ones for outreach.

scrapebox-link-propsecting

Step 4: Sifting through link prospects using ScreamingFrog

After scraping hundreds of links, it’s unlikely that all of them will be relevant, legitimate link prospects. This is when ScreamingFrog SEO Spider tool comes in.

We want to look through the HTML pages we have scrapped earlier and search for these specific phrases:

  • write for us
  • contribute to our blog

To do that, we’ll set up some custom filters on ScreamingFrog.

screamingfrog-custom-search filters

Next, set up ScreamingFrog to ‘List’ mode. Navigate to ‘Mode’ -> ‘List’.

screamingfrog-list-mode-tutorial

Final step: upload the qualified URLs from our excel sheet earlier. Now sit back and let ScreamingFrog work its magic.

screamingfrog urls paste manually tutorial

After ScreamingFrog finishes processing the URLs, click on the ‘Custom’ tab to the right and you will get a list of custom filtered URLs.

The below screenshot shows 162 URLs that contain the custom filter, “write for us”. From my initial 900+URLs, I’ve filtered it down to over 162 relevant link targets — all within a few minutes.

screamingfrog guest post custom filter

So now we have a nice list of 162 links we can start reaching out to. First, we need their contact details. Our next tool for this — Buzzstream.

Buzzstream is a fantastic link building tool for streamlined outreach. There’s really no better tool out there that can build links as efficiently.

The features we’ll be using:

  • Contact details gathering
  • Outreach through Buzzstream’s email interface

Btw, you’ll need an account to do this, but you can sign up for a free account (14-day trial) which gives you full access.

buzzstream-import urls

In Buzzstream, create a new project and click on ‘add websites’ button and select ‘import from csv’ option.

Now upload the 162 URLs scrapped and filtered from ScreamingFrog into Buzzstream. Within 10–15mins, Buzzstream will have gathered a list of contact info from your website links.

This saves you days of work combing through websites for emails. (Been there, done that. Not fun).

buzzstream-contact-details

In less than 1 hour, I’ve gathered 160 niche-relevant link targets with all their contact info, including social media profiles and emails.

Now we just have to use Buzzstream’s email interface for outreach and build some quality links!

leannewong-profile-2018

Author: Leanne Wong

Leanne Wong helps bloggers and entrepreneurs grow an audience and make more money online with SEO and Pinterest strategies.

Join 2,000+ others and receive full access to:

Leanne’s library of free resources to accelerate your blog growth.

4 People reacted on this

    1. Hi Esther! I am using the paid version 🙂 The free version is good enough for lightweight link prospecting and crawling up to 500 URLs though!

  1. Right now i am able to understand what really a backlink is and how to get good backlinks for my new blog or website. Thanking you the author for sharing such originally and understanding content for backlinks. Thanks so much.

    1. Thats awesome, Dipak! I’m so glad my article has been helpful for you. Let me know if you have any questions at all 🙂

Leave a Reply:

Your email address will not be published. Required fields are marked *

shares