#8876 closed enhancement (fixed)
Disallow web robots indexing of Base Documentation Pages
| Reported by: | Owned by: | ||
|---|---|---|---|
| Priority: | normal | Milestone: | 0.12 |
| Component: | wiki system | Version: | |
| Severity: | normal | Keywords: | newhelp consider |
| Cc: | vslavik@… | Branch: | |
| Release Notes: | |||
| API Changes: | |||
| Internal Changes: | |||
Description
When searching for information on Trac, I often end up swimming through multiple instances of the same page on different Trac installations. Example (I've seen worse, but this was the one I just hit): When doing a Google search for Trac "component remove", 7 out of the 10 links return on the first page are the TracAdmin page from different sites.
Suggestion: If a page is a base documentation page (unless here at trac.edgewall.org), send html headers to instruct searchbots to not index.
<META NAME="ROBOTS" CONTENT="NOINDEX">
How to determine if a page is a base documentation page I am not sure. One possibility:
- A page that was last modified by "trac", i.e. a page that the Trac install or upgrade made or modified.
Attachments (1)
Change History (7)
comment:1 by , 16 years ago
comment:2 by , 16 years ago
| Keywords: | newhelp consider added |
|---|---|
| Milestone: | → next-major-0.1X |
Thanks for the suggestion.
by , 16 years ago
| Attachment: | trac-no-robots-on-stock-pages.patch added |
|---|
comment:3 by , 16 years ago
| Cc: | added |
|---|
The patch above implements this behavior, using the check for trac user.
comment:4 by , 16 years ago
| Milestone: | next-major-0.1X → 0.12 |
|---|---|
| Resolution: | → fixed |
| Status: | new → closed |
Applied in r9137, thanks!
comment:5 by , 16 years ago
| Owner: | set to |
|---|
comment:6 by , 16 years ago
ingenius, that's all i have to say. i mean, this been bothering me for a looong bit.



Replying to mpotter@…:
Actually that technique of "trac" modified documents would provide the desired affect of allowing indexing here since the documents here are not modified by "trac".