Need help on Scraping

by TravisO 19 replies
Hi Warriors,

I am having a bad moment now. I do not know how to scrape. I use automation while I am backlinking(ultimate demon) .

Anyone can help me how to scrape manually?
I need many websites to post. Please!
Anyone can help me?

If possible, a step by step. Please please!


Thanks,
Travis
#search engine optimization #scraping
  • Profile picture of the author lotsofsnow
    Go to homedepot I buy a scraper. They cost about $5 in a3-pack.

    LOL
    {{ DiscussionBoard.errors[8416504].message }}
    • Profile picture of the author TravisO
      Originally Posted by hpgoodboy View Post

      Go to homedepot I buy a scraper. They cost about $5 in a3-pack.

      LOL
      I just want to have an idea to at least manually scrape things up.
      {{ DiscussionBoard.errors[8416517].message }}
      • Profile picture of the author godoveryou
        Why manually scrape things?

        If you're automating the posting and looking to use questionable content anyways, why not go with a full on content generator? Content Foundry will be releasing human readable content in the near future. (Source: BIG UPDATE: Human Readable Content On Tap? + PDF Scrapping)

        ... What? I couldn't help it, I had to reference myself. :p

        Anyways... the point is that if you have made up your mind to go down this road, then commit.

        The reason I say that is pretty simple.

        If you half-do anything than you aren't going to see much success. It's like people that buy xrumer but are too cheap to get a bulletproof server - then complain that they can't get anywhere with it.

        Mass link building comes down to one thing - volume. You need to be able to hammer out a lot of links in varying methods if you are going to automate your link building operation.

        But Wait, I Didn't Say Anything About MASS Link Building...

        Well, once you are using questionable content, mass link building is the next logical step.

        My point is that I don't understand why you are doing this by hand when you are automating the rest.
        Signature
        Don't Know Me? - Read my interview at Matthewwoodward.co.uk
        http://www.godoveryou.com/
        {{ DiscussionBoard.errors[8416561].message }}
  • Profile picture of the author Cobaki
    Travis, what are you trying to scrape? I normally outsource that kind of work.
    {{ DiscussionBoard.errors[8416564].message }}
    • Profile picture of the author TravisO
      Originally Posted by Cobaki View Post

      Travis, what are you trying to scrape? I normally outsource that kind of work.
      I actually have this Ultimate Demon with me. It has a scraping feature. I have tried this codes to scrape things "submit articles", "submission guidelines" "top articles"

      I do not understand why I must enter this "submit articles", "submission guidelines" "top articles.

      I have tried that and I can't scrape anymore. My IP has been banned. Is it possible to refresh all of my IPs and scrape again.

      I just want to understand this codes and have my own "submit articles", "submission guidelines" "top articles".
      {{ DiscussionBoard.errors[8416586].message }}
  • Profile picture of the author andishm
    Originally Posted by TravisO View Post

    Hi Warriors,

    I am having a bad moment now. I do not know how to scrape. I use automation while I am backlinking(ultimate demon) .

    Anyone can help me how to scrape manually?
    I need many websites to post. Please!
    Anyone can help me?

    If possible, a step by step. Please please!


    Thanks,
    Travis
    You can try scrapebox + footprints of UD supported sites to get good list of your own custom sites.
    Signature

    I love SEO... I love ranking my websites on TOP.

    {{ DiscussionBoard.errors[8418878].message }}
  • Profile picture of the author JSProjects
    What GOY said. If you're diving in, you may as well go ALL in.

    Anyways, for a scraper, the most widely used is Scrapebox. Gscraper is another alternative.
    {{ DiscussionBoard.errors[8418904].message }}
    • Profile picture of the author UnkwnUsr
      Originally Posted by JSProjects View Post

      What GOY said. If you're diving in, you may as well go ALL in.

      Anyways, for a scraper, the most widely used is Scrapebox. Gscraper is another alternative.
      Don't you pretty much need to use a proxy for these programs if you don't want to get banned?
      {{ DiscussionBoard.errors[8419039].message }}
      • Profile picture of the author godoveryou
        Originally Posted by UnkwnUsr View Post

        Don't you pretty much need to use a proxy for these programs if you don't want to get banned?
        Proxies are cheap, but that aside I've had Content Foundry scraping over 500 article per minute without them.

        A big portion of a content generator's success is where it is getting it's content from. CF has one of the best set of resources that I know of.
        Signature
        Don't Know Me? - Read my interview at Matthewwoodward.co.uk
        http://www.godoveryou.com/
        {{ DiscussionBoard.errors[8419267].message }}
      • Profile picture of the author Kevin Maguire
        Originally Posted by UnkwnUsr View Post

        Don't you pretty much need to use a proxy for these programs if you don't want to get banned?
        GScraper has 56 x 64Gig - 1000MB - Dedi Servers port scanning for open proxies 24/7/365.

        So you could say, proxies are not an issue for users.

        Scrapebox is more of a toy for kids when it comes to scraping.



        (Full Disclosure)
        GScraper is another pie I have my finger in.

        Footprints for Article sites


        Code:
        “index.php?page=submitarticle”
        “Articles with any spelling or grammar errors will be deleted”
        “upload your articles and keep updated about new articles.”
        “If you have hired a ghost writer, you agree that you have”
        “Publish your article in RSS format for other websites to syndicate”
        “Do not submit articles filled with spelling errors and bad grammar”
        “Using Article Directory plugin”
        “There are * published articles and * registered authors”
        “RSS Articles” “RSS comments” “Recent Articles”
        “Powered by ArticleMS”
        “You do not have permission to comment. If you log in, you may be able to comment.”
        “By publishing information packed articles, you’ll soon enjoy”
        “Use the articles in our directory on your website to provide your visitors”
        “Member Login to Submit Article”
        “login2submitart.php”
        “submitarticles.php”
        “Powered By : Article Friendly”
        “submitart.php”
        ArticleMS + inurl:/articles/
        ArticleMS + inurl:/art/
        ArticleMS + inurl:/category/
        ArticleMS + inurl:/articlems/
        ArticleMS + inurl:/artms/
        Article Dashboard + inurl:/article/
        Article Dashboard + inurlopulararticles.php
        Article Dashboard + inurl:/profile/
        Article Dashboard + inurl:/login2submitart2.php
        Article Dashboard + inurl:/submitarticles.php
        Powered by: php Link Directory
        powered by PHPLD
        Powered by WSN Links
        powered by PHP Weby
        Powered by cpLinks
        Powered by cpDynaLinks
        powered by BosDirectory
        Powered by Link manager LinkMan
        Powered by Gossamer Links
        Powered by K-Links
        Powered by In-Link
        Powered by eSyndiCat Directory Software
        Powered by: qlWebDS Pro
        Powered by Directory software by LBS
        powered by phpMyDirectory.com
        Powered by HubDir PHP directory script  
        Powered by free article directory
        Powered by Article Dashboard
        Powered by ArticleMS
        powered by article dashboard
        Powered by: php Link Directory
        powered by PHPLD
        Powered by WSN Links
        powered by PHP Weby
        Powered by cpLinks
        Powered by cpDynaLinks
        powered by BosDirectory
        Powered by Link manager LinkMan
        Powered by Gossamer Links
        Powered by K-Links
        Powered by In-Link
        Powered by eSyndiCat Directory Software
        Powered by: qlWebDS Pro
        Powered by Directory software by LBS
        powered by phpMyDirectory.com
        Powered by HubDir PHP directory script
        Powered by Article Dashboard "Sign Up for a free account" -forum
        Powered by ArticleMS "Submit Article" "Latest Articles"
        Powered by Article Directory
        Powered By: Article Friendly
        Powered By Article directory software
        powered by NextAge Tech
        Powered by Revenue Sharing Article Directory Script
        powered by articlems
        Powered by Article Directory plugin
        Powered by Free Directory Submissions
        Powered by the original Free Article Directory
        Powered by Article Directory %KW%
        Powered By: Article Friendly %KW%
        Powered By Article directory software %KW%
        powered by NextAge Tech %KW%
        Powered by Revenue Sharing Article Directory Script %KW%
        powered by article dashboard %KW%
        powered by articlems %KW%
        Powered by Article Directory plugin %KW%
        “Powered by the original Free Article Directory”
        “Powered by Article Friendly”
        “Powered By : Article Friendly”
        “Powered by Article Dashboard”
        “Powered by ArticleMS”
        “Powered by WordPress · Using Article Directory plugin · Theme by Dimox”
        Submit Articles "Total Articles" "Total Authors" "Total Downloads"
        Submit Articles
        Submit Article
        submit an article
        Submit Free Articles
        Submit New Articles for your articles marketing
        inurl:/articles/
        inurl:/art/
        inurl:/category/
        inurl:/articlems/
        inurl:/artms/
        inurl:logintosubmitart.php
        inurl:/article/
        inurl: populararticles.php - article dashboard
        inurl:/profile/ - article dashboard -- profile page -- great!!!
        inurl:/login2submitart2.php - article dashboard
        inurl:/submitarticles.php - article dashboard
        intext:”Confirmation request email was sent to your email address”
        “index.php?page=submitarticle”
        “Articles with any spelling or grammar errors will be deleted”
        “upload your articles and keep updated about new articles.”
        “If you have hired a ghost writer, you agree that you have”
        “Publish your article in RSS format for other websites to syndicate”
        “Do not submit articles filled with spelling errors and bad grammar”
        “Using Article Directory plugin”
        “There are * published articles and * registered authors”
        “RSS Articles” “RSS comments” “Recent Articles”
        “RSS Articles” “RSS comments” “Recent Articles” “Authorization” “Username:” “Password:” “Remember Me” “Register | Lost your password?”
        “You do not have permission to comment. If you log in, you may be able to comment.”
        “By publishing information packed articles, you’ll soon enjoy”
        “Use the articles in our directory on your website to provide your visitors”
        “Member Login to Submit Article”
        “login2submitart.php”
        “submitarticles.php”
        “submitart.php”
        Article Directory Powered by WordPress %KW%
        registered authors in our article directory
        Additional Articles From *
        Welcome to article directory *. Here you can find interesting and useful information on most popular themes.
        There are * published articles and * registered authors in our article directory.
        This author has published * articles so far. More info about the author is coming soon.
        There are now * Excellent Articles in our Database from * Authors
        “Use or our service is protected by our Privacy Policy"
        Should be enough to get you started.
        {{ DiscussionBoard.errors[8419675].message }}
        • Profile picture of the author TravisO
          Originally Posted by Kevin Maguire View Post

          GScraper has 56 x 64Gig - 1000MB - Dedi Servers port scanning for open proxies 24/7/365.

          So you could say, proxies are not an issue for users.

          Scrapebox is more of a toy for kids when it comes to scraping.



          (Full Disclosure)
          GScraper is another pie I have my finger in.

          Footprints for Article sites


          Code:
          “index.php?page=submitarticle”
          “Articles with any spelling or grammar errors will be deleted”
          “upload your articles and keep updated about new articles.”
          “If you have hired a ghost writer, you agree that you have”
          “Publish your article in RSS format for other websites to syndicate”
          “Do not submit articles filled with spelling errors and bad grammar”
          “Using Article Directory plugin”
          “There are * published articles and * registered authors”
          “RSS Articles” “RSS comments” “Recent Articles”
          “Powered by ArticleMS”
          “You do not have permission to comment. If you log in, you may be able to comment.”
          “By publishing information packed articles, you’ll soon enjoy”
          “Use the articles in our directory on your website to provide your visitors”
          “Member Login to Submit Article”
          “login2submitart.php”
          “submitarticles.php”
          “Powered By : Article Friendly”
          “submitart.php”
          ArticleMS + inurl:/articles/
          ArticleMS + inurl:/art/
          ArticleMS + inurl:/category/
          ArticleMS + inurl:/articlems/
          ArticleMS + inurl:/artms/
          Article Dashboard + inurl:/article/
          Article Dashboard + inurlopulararticles.php
          Article Dashboard + inurl:/profile/
          Article Dashboard + inurl:/login2submitart2.php
          Article Dashboard + inurl:/submitarticles.php
          Powered by: php Link Directory
          powered by PHPLD
          Powered by WSN Links
          powered by PHP Weby
          Powered by cpLinks
          Powered by cpDynaLinks
          powered by BosDirectory
          Powered by Link manager LinkMan
          Powered by Gossamer Links
          Powered by K-Links
          Powered by In-Link
          Powered by eSyndiCat Directory Software
          Powered by: qlWebDS Pro
          Powered by Directory software by LBS
          powered by phpMyDirectory.com
          Powered by HubDir PHP directory script  
          Powered by free article directory
          Powered by Article Dashboard
          Powered by ArticleMS
          powered by article dashboard
          Powered by: php Link Directory
          powered by PHPLD
          Powered by WSN Links
          powered by PHP Weby
          Powered by cpLinks
          Powered by cpDynaLinks
          powered by BosDirectory
          Powered by Link manager LinkMan
          Powered by Gossamer Links
          Powered by K-Links
          Powered by In-Link
          Powered by eSyndiCat Directory Software
          Powered by: qlWebDS Pro
          Powered by Directory software by LBS
          powered by phpMyDirectory.com
          Powered by HubDir PHP directory script
          Powered by Article Dashboard "Sign Up for a free account" -forum
          Powered by ArticleMS "Submit Article" "Latest Articles"
          Powered by Article Directory
          Powered By: Article Friendly
          Powered By Article directory software
          powered by NextAge Tech
          Powered by Revenue Sharing Article Directory Script
          powered by articlems
          Powered by Article Directory plugin
          Powered by Free Directory Submissions
          Powered by the original Free Article Directory
          Powered by Article Directory %KW%
          Powered By: Article Friendly %KW%
          Powered By Article directory software %KW%
          powered by NextAge Tech %KW%
          Powered by Revenue Sharing Article Directory Script %KW%
          powered by article dashboard %KW%
          powered by articlems %KW%
          Powered by Article Directory plugin %KW%
          “Powered by the original Free Article Directory”
          “Powered by Article Friendly”
          “Powered By : Article Friendly”
          “Powered by Article Dashboard”
          “Powered by ArticleMS”
          “Powered by WordPress · Using Article Directory plugin · Theme by Dimox”
          Submit Articles "Total Articles" "Total Authors" "Total Downloads"
          Submit Articles
          Submit Article
          submit an article
          Submit Free Articles
          Submit New Articles for your articles marketing
          inurl:/articles/
          inurl:/art/
          inurl:/category/
          inurl:/articlems/
          inurl:/artms/
          inurl:logintosubmitart.php
          inurl:/article/
          inurl: populararticles.php - article dashboard
          inurl:/profile/ - article dashboard -- profile page -- great!!!
          inurl:/login2submitart2.php - article dashboard
          inurl:/submitarticles.php - article dashboard
          intext:”Confirmation request email was sent to your email address”
          “index.php?page=submitarticle”
          “Articles with any spelling or grammar errors will be deleted”
          “upload your articles and keep updated about new articles.”
          “If you have hired a ghost writer, you agree that you have”
          “Publish your article in RSS format for other websites to syndicate”
          “Do not submit articles filled with spelling errors and bad grammar”
          “Using Article Directory plugin”
          “There are * published articles and * registered authors”
          “RSS Articles” “RSS comments” “Recent Articles”
          “RSS Articles” “RSS comments” “Recent Articles” “Authorization” “Username:” “Password:” “Remember Me” “Register | Lost your password?”
          “You do not have permission to comment. If you log in, you may be able to comment.”
          “By publishing information packed articles, you’ll soon enjoy”
          “Use the articles in our directory on your website to provide your visitors”
          “Member Login to Submit Article”
          “login2submitart.php”
          “submitarticles.php”
          “submitart.php”
          Article Directory Powered by WordPress %KW%
          registered authors in our article directory
          Additional Articles From *
          Welcome to article directory *. Here you can find interesting and useful information on most popular themes.
          There are * published articles and * registered authors in our article directory.
          This author has published * articles so far. More info about the author is coming soon.
          There are now * Excellent Articles in our Database from * Authors
          “Use or our service is protected by our Privacy Policy"
          Should be enough to get you started.
          I supposed to copy this for every line to scraper right?
          {{ DiscussionBoard.errors[8420125].message }}
        • Profile picture of the author godoveryou
          Originally Posted by Kevin Maguire View Post

          GScraper is another pie I have my finger in
          You know, I use a lot of scraping tools - far beyond your average 'clicky button' windows apps.

          I tried talking to the guy for that... Even gave him access to a few of my private proxies and still no private proxy support.

          While private proxies may not be a big deal to someone that can only afford a few hundred of them, I literally have thousands of them at my disposal..( as shown in youtube videos)..

          If you have your finger in it, fix that hole (unless its been fixed already.) It's bad enough to be a windows application, but not supporting private proxies for as long as it did really makes it a joke.
          Signature
          Don't Know Me? - Read my interview at Matthewwoodward.co.uk
          http://www.godoveryou.com/
          {{ DiscussionBoard.errors[8420562].message }}
  • Profile picture of the author seoace
    Well, just use GSA SER. It helps you scrape for link targets automatically so you don't have to do it.
    Signature
    Who else needs a SEO Client Dashboard for their SEO services ?
    Let your clients monitor their SEO campaigns (Rankings, Backlinks and Work Done)
    {{ DiscussionBoard.errors[8419029].message }}
  • Profile picture of the author chawk
    Learn how to program, start with python. There are tutorials on the web that show how to scrape.
    {{ DiscussionBoard.errors[8419787].message }}
  • Profile picture of the author r0dvan
    I can help you with custom coding whatever you need.
    But actually if you are using automatic backlink creation, then you are not doing a good job. That will get you down instead of up.
    Signature
    WIZARD OF BOTS: Custom bots, scrapers, classes.
    MARKETING MALL: Freebies (Graphics, software, tips)
    LINKEDIN MARKETER PRO: Best GreyHat Linkedin Automation Tool
    {{ DiscussionBoard.errors[8419882].message }}
  • Profile picture of the author jasonsluck
    I have a friend that does scraping, its all custom automation. If you can dream it, he can automate it. His rates start at 5k, PM me if interested.
    {{ DiscussionBoard.errors[8432365].message }}
    • Profile picture of the author dvduval
      Originally Posted by jasonsluck View Post

      I have a friend that does scraping, its all custom automation. If you can dream it, he can automate it. His rates start at 5k, PM me if interested.
      A decent windows programmer with scrapping experience could build something specific for a few hundred dollars. As it gets into many different types of content, of course that drives up the cost.
      Signature
      It is okay to contact me! I have been developing software since 1999, creating many popular products like phpLD.
      {{ DiscussionBoard.errors[8549385].message }}
      • Profile picture of the author jfoppoli
        Why would anyone want to invest 5k or a few hundred dollars for scraping if you have scraping software to do it for around $60-$70?
        Signature
        Julio Foppoli
        Author of Jump Start Your Spanish
        The Most Radically Effective Starter Spanish Program EVER Developed!
        {{ DiscussionBoard.errors[8639549].message }}
        • Profile picture of the author JSProjects
          Originally Posted by jfoppoli View Post

          Why would anyone want to invest 5k or a few hundred dollars for scraping if you have scraping software to do it for around $60-$70?
          I was thinking the same.

          Scrapebox has never once let me down. And if you know the right footprints, you can scrape mass quantities of ANYTHING.
          {{ DiscussionBoard.errors[8659177].message }}

Trending Topics