Extract post titles from a Wordpress blog (not mine)

by sbac92
5 replies
  • SEO
  • |
Hi,
I'd like to know what is the best method to extract the titles of all the posts on a wordpress blog? For information, I want to list all posts from a blog which is not mine. I only have the URL.
I've tried that trick with Yahoo Pipes but it doesn't seem to work with wordpress:

hxxp://www.bloggersentral.com/2009/12/list-post-titles-in-alphabetical-order.html?commentPage=2

Any help would be very appreciated!
Kind regards,

Kate
#blog #extract #mine #post #titles #wordpress
  • Profile picture of the author Microsys
    Do you mean extract URLs + titles of all posts in a blog? Any scraper/crawler tool will do. In A1 Website Analyzer it would be something like this:

    a)
    Scan website with Wordpress scan preset (to exclude e.g. catagory pages)

    b)
    When done, i.e. after scan finished, set visible columns to be e.g. URL + title

    c)
    Export to CSV


    But it all depends on what you need it for. If you already know a crawler/scraper tool, there's a good chance you can do what you want if you configure it properly.
    {{ DiscussionBoard.errors[8707224].message }}
  • Profile picture of the author squadron
    Originally Posted by sbac92 View Post

    Hi,
    I'd like to know what is the best method to extract the titles of all the posts on a wordpress blog? ...

    Kate
    If they have a sitemap running, you should be able to cut and paste a list of links. To check, go to http://theirdomainname.com/sitemap.xml and see what happens.

    A lot of blogs have a Google sitemap plugin installed.
    {{ DiscussionBoard.errors[8707750].message }}
  • Profile picture of the author Kevin Maguire
    GScraper can do this.
    {{ DiscussionBoard.errors[8707775].message }}
    • Profile picture of the author adjmpw
      First get the page content and then use regex ! i can do that easily via php ..pm me
      {{ DiscussionBoard.errors[8709146].message }}
      • Profile picture of the author sbac92
        Hi,
        A big thanks to all of you for your great and quick support!
        I'll use A1 Website Analyzer, just had a try and it seems to be ok!
        When exporting in CSV format, do you know how to keep the hyperlink in the URLs field? In my first test, the URL is recognised as a text instead of a clickable URL.

        Many thanks!
        Regards,
        {{ DiscussionBoard.errors[8709852].message }}

Trending Topics