8 replies
Ages ago I traffic geyser could scrape YouTube videos, peel off the hologram, ads, and everything and play it on my site.

I'm looking to do something similar. I need to pull thousands of videos into a new video sharing site. We'll be adding subtitles to all the videos. We have a player to do that. BUT, where do I either: A) get the base original content from in the first place; or B) scrape enough videos into the site to give it a robust launch until the users populate the videos?

Suggestions? Thanks for your help!

Adryenn
#scraping #video scraping #youtube
  • Profile picture of the author kkoechel
    Originally Posted by Adryenn View Post

    Ages ago I traffic geyser could scrape YouTube videos, peel off the hologram, ads, and everything and play it on my site.

    I'm looking to do something similar. I need to pull thousands of videos into a new video sharing site. We'll be adding subtitles to all the videos. We have a player to do that. BUT, where do I either: A) get the base original content from in the first place; or B) scrape enough videos into the site to give it a robust launch until the users populate the videos?

    Suggestions? Thanks for your help!

    Adryenn
    youtube-dl available here: packages.debian.org/sid/youtube-dl

    you really need a script to combine with this. How many thousands and do you have an ftp server to drop them on?
    {{ DiscussionBoard.errors[2865153].message }}
  • If you upload content for online storage and never keep an archival copy, you need to rethink your workflow. "It absolutely amazes me that people will put their important, priceless data on a third party server, and not even go to the trouble of making backups, saving it somewhere." If you don't archive locally, you will have problems. If someone else is planning to harvest your stuff dynamically, they should know how to scrape your stuff with Firefox. Here are Open Source YouTube scraping tips worth trying before YouTube deletes your videos.
    {{ DiscussionBoard.errors[2884504].message }}
  • Profile picture of the author LosingMyHair
    I use get-flash-videos and Amazon S3. You can write a little script to download fetch any YouTube video by supplying a videoID (in the YouTube URL) or fetch any video from several supported web sites by doing a keyword search. Then have the script upload the files to an S3 account (or just keep them on your server).

    And yeah, it strips the YouTube watermark too.
    {{ DiscussionBoard.errors[2886474].message }}
    • Profile picture of the author mojojuju
      Originally Posted by LosingMyHair View Post

      I use get-flash-videos and Amazon S3.
      Hey, that's a cool script. I just used it to download my son's sleeping medication which is this:


      It works like a charm.
      Signature

      :)

      {{ DiscussionBoard.errors[2886796].message }}
      • Profile picture of the author Adryenn
        Thanks! I'll look into those and see what works. I was looking for a bulk scraping, to download all videos related to a specific topic.
        Signature

        Adryenn Ashley - , Wow! Is Me, http://www.wowisme.net

        {{ DiscussionBoard.errors[2972348].message }}
        • Profile picture of the author nmarley
          Originally Posted by Adryenn View Post

          I was looking for a bulk scraping, to download all videos related to a specific topic.
          I use clive for scraping a single video. For bulk, I would wrap a script around it. And to find all videos related to a specific topic, that would be a different script/process. At least, that's how I would go about it.
          {{ DiscussionBoard.errors[3013756].message }}
          • Profile picture of the author SaSh
            Great tips in this thread! I'll check out that Clive script for sure.
            {{ DiscussionBoard.errors[3014001].message }}
  • Profile picture of the author kkoechel
    I wrote this for single uses: www.mechanicalreview.com
    {{ DiscussionBoard.errors[2990997].message }}

Trending Topics