Desperate newbie seeks help... Please?

6 replies
Hey guys!

I have to extract more than 2,500 URLs for a client.

The file is an .odt Open Office file, and the links are like that.


I have also the pdf version of the file, and the HTML version.

Do you guys know how to extract all those links using a free software?

Thank you very much!
#desperate #newbie #seeks
  • Profile picture of the author marketingwithjosh
    find/replace all the "numbers" and everything until you have nothing but the links standing alone, then copy them all at once? I dont know of any software that'll extract them for you.... whats the required output?

    Joshua
    Signature
    50 Powerful Ways To Build Your List, FREE.http://www.joshuakbowen.com/lbf

    {{ DiscussionBoard.errors[5063173].message }}
  • Profile picture of the author SeanLee
    hey Joshua!

    How can I replace all the numbers in bulk, as everyone of them is unique (going from 1 to 2532 :S )

    I want the URL's alone, without anything else.
    {{ DiscussionBoard.errors[5063222].message }}
  • Profile picture of the author bogdan247
    With Notepad++: (I am using 5.82)

    1. Copy all text in a text file.
    2. Open with Notepad++
    3. Select all text.
    4. Go to TextFX>TextFX Tools>Delete Line Numbers or First Word

    Hope it helps.
    {{ DiscussionBoard.errors[5063255].message }}
  • Profile picture of the author SeanLee
    Thanks a million, mate!

    You have solved my problem.

    Bye!
    {{ DiscussionBoard.errors[5063299].message }}
  • Profile picture of the author trytolearnmore
    Wouldn't replacing all the numbers also remove them from the URL's?
    {{ DiscussionBoard.errors[5063343].message }}
  • Profile picture of the author Robert H Cwik
    I would suggest a four (or five) steps solution.

    1. Optional: Open the document in OO and save as Doc if you have MS Office.
    ----
    2. Open the document, replace all spaces with tabs + (replace all soft line breaks (^l) with hard line breaks (^p) if any)
    3. Convert text into table (using tab as column separator and hard line break as row separator)
    4. Select and delete columns not containing the URLs
    5. Convert the table back to text.

    This is how I do that. Hope that helps.
    {{ DiscussionBoard.errors[5063859].message }}

Trending Topics