Go Back   WarriorForum - Internet Marketing Forums > The Warrior Forum > Adsense / PPC / SEO Discussion Forum
Register Blogs FAQ Social Groups CalendarHelp Desk

Reply
 
LinkBack Thread Tools
Old 03-11-2011, 03:31 PM   #1
Advanced Warrior
War Room Member
 
derh's Avatar
 
Join Date: Nov 2009
Location: Kansas
Posts: 525
Thanks: 96
Thanked 10 Times in 10 Posts
Contact Info
Send a message via Skype™ to derh
Arrow Easy Way to gather ALL permalinks (inner page links) from a site??

I have a few wordpress sites that have over 300 permalinks (www.mysite.com/page1)

Is there a way I can easily get a text or excel list of ALL these inner page links (permalinks)..

Some sort of free software or something to gather them...


Thanks

derh is offline   Reply With Quote
Old 03-11-2011, 04:10 PM   #2
Advanced Warrior
War Room Member
 
Bryan V's Avatar
 
Join Date: Jan 2010
Location: USA
Posts: 523
Thanks: 37
Thanked 65 Times in 61 Posts
Default Re: Easy Way to gather ALL permalinks (inner page links) from a site??

I would just use scrapebox. but that isn't free.. I think seoquake has some outbound/internal link extractor thing. Maybe use that on google results pages or your sitemap.

Perhaps an attic I shall seek.
Bryan V is online now   Reply With Quote
Old 03-11-2011, 04:42 PM   #3
Advanced Warrior
War Room Member
 
derh's Avatar
 
Join Date: Nov 2009
Location: Kansas
Posts: 525
Thanks: 96
Thanked 10 Times in 10 Posts
Contact Info
Send a message via Skype™ to derh
Default Re: Easy Way to gather ALL permalinks (inner page links) from a site??

I have scrapebox....

how do i proceed.....in using it...for that purpose????

Do I just use a footprint...mysite.com?

If I do that....I get feeds and tags...which I don't want...

derh is offline   Reply With Quote
Old 03-11-2011, 04:58 PM   #4
Advanced Warrior
War Room Member
 
Bryan V's Avatar
 
Join Date: Jan 2010
Location: USA
Posts: 523
Thanks: 37
Thanked 65 Times in 61 Posts
Default Re: Easy Way to gather ALL permalinks (inner page links) from a site??

yeah I would just use the footprint like you said.

If there that many that you can't delete it by hand, just get notepad++ to use Regular Expression to remove it
Replace -> Find what: ".*feed.*" replace with:"(blank)"
Replace -> Find what: ".*tag.*" replace with:"(blank)"
this will replace any URL that has "feed" or "tag" in it or whatever you find works best. make sure "regular expression" is checked in the replace dialog box instead of "normal"

Or look into a wordpress sitemap generator plugin that will list the URLs (if you're on wordpress).

Not sure if theres an easier way.

Perhaps an attic I shall seek.
Bryan V is online now   Reply With Quote
Old 03-11-2011, 07:02 PM   #5
Advanced Warrior
War Room Member
 
derh's Avatar
 
Join Date: Nov 2009
Location: Kansas
Posts: 525
Thanks: 96
Thanked 10 Times in 10 Posts
Contact Info
Send a message via Skype™ to derh
Default Re: Easy Way to gather ALL permalinks (inner page links) from a site??

Thanks....

derh is offline   Reply With Quote
Old 03-12-2011, 06:17 AM   #6
Programmer
War Room Member
 
Join Date: Nov 2007
Posts: 158
Thanks: 0
Thanked 5 Times in 5 Posts
Social Networking View Member's FaceBook Profile  View Member's Twitter Profile  View Member's YouTube Profile
Default Re: Easy Way to gather ALL permalinks (inner page links) from a site??

If you don't mind using a trial (i.e. free 30 days) for a one-off, you could try A1 Sitemap Generator.

1) Create an output filter that match the URLs you want.
(uncheck the bug button "Simplified easy mode" to see the options)
2) Crawl website.
3) View results as list instead of tree.
4) Export result to CSV

Sitemapper is offline   Reply With Quote
Reply

  WarriorForum - Internet Marketing Forums > The Warrior Forum > Adsense / PPC / SEO Discussion Forum

Tags
easy, gather, links, page, permalinks, site

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off



All times are GMT -6. The time now is 05:11 AM.