Robots.txt problem/delima

by Steviebone

Posted: 9 years ago 0 replies

I am trying to reconfigure a robots.txt file. I know this approach may be frowned upon but... I want to exclude everything except certain specified directories (instead of allowing everything except certain paths/files)

Consider this block:

Code:

User-agent: *
Disallow: /
Allow: /Dir1/
Allow: /Dir2/
Allow: /Dir3/
Allow: /Dir4/

This works except for one fatal flaw. It blocks the use of the default home page referenced by the url domain name alone, such as:

Code:

www.domainname.com

Since the 'index.htm' or whatever default file returned by the web-server is implied and not implicit the rule fails for the domain name by itself. I don't care much for the idea of allowing everything by default and then having to hunt down everything I don't want indexed/crawled. Whoever came up with this idea was creating crawlers

I know you can allow subdirs after a disallow statement but how then can you handle anything in the root? Hell, that's the one place I want to limit. It seems like it would be much simpler to be able to just list areas of a site you want crawled, not the other way around. Am I crazy? Or is this just stupid?

Any workarounds I can't see?

#problem or delima #robotstxt

Trending Topics

9 {{ upvoteCount | shortNum }}

Any thoughts on golf's latest phenom...Scottie Scheffler??

discrat 10 hours ago in Off Topic

He has won 4 out of the last 5 tournaments he has entered. Only Tiger Woods has matched this. Could he win the Grand Slam i.e. all four majors in ... [read more]

5 Replies
Share
1 {{ upvoteCount | shortNum }}

The Twilight Zone

joseph7384 3 hours ago in Off Topic

How has everyone been? It's my first time back in quite a while and after going through some threads this has been playing in my head on an infinite loop! ... [read more]

Reply
Share
2 {{ upvoteCount | shortNum }}

Should i Index Author pages?

kurosaki4d 1 day ago in SEO

Hello Everyone, Recently i've been doing a lot of improvements in terms of EEAT, including adding authors to my sites. In my Rank-Math settings, I have the following options set ... [read more]

1 Reply
Share
4 {{ upvoteCount | shortNum }}

best high quality crypto traffic.

celente 1 day ago in Internet Marketing

anyone know some good sites or resources to get high quality traffic. cryptocurrency niche! I am trailing crypto offers, so after mid to good quality traffic. ? any thoughts ?

3 Replies
Share
76 {{ upvoteCount | shortNum }}

What's the Best IM-Related Skill to Learn Today That Can Help in the Future?

Marx Vergel Melencio 3 days ago in Internet Marketing

I'll start. My vote goes to: The ability to design rational thought experiments. • That's the most useful skill that I learned throughout the years for future-proofing anything. I ... [read more]

26 Replies
Share

Advertise with Us