#8876 closed enhancement (fixed)
Disallow web robots indexing of Base Documentation Pages
Reported by: | Owned by: | ||
---|---|---|---|
Priority: | normal | Milestone: | 0.12 |
Component: | wiki system | Version: | |
Severity: | normal | Keywords: | newhelp consider |
Cc: | vslavik@… | Branch: | |
Release Notes: | |||
API Changes: | |||
Internal Changes: |
Description
When searching for information on Trac, I often end up swimming through multiple instances of the same page on different Trac installations. Example (I've seen worse, but this was the one I just hit): When doing a Google search for Trac "component remove", 7 out of the 10 links return on the first page are the TracAdmin page from different sites.
Suggestion: If a page is a base documentation page (unless here at trac.edgewall.org), send html headers to instruct searchbots to not index.
<META NAME="ROBOTS" CONTENT="NOINDEX">
How to determine if a page is a base documentation page I am not sure. One possibility:
- A page that was last modified by "trac", i.e. a page that the Trac install or upgrade made or modified.
Attachments (1)
Change History (7)
comment:1 by , 15 years ago
comment:2 by , 15 years ago
Keywords: | newhelp consider added |
---|---|
Milestone: | → next-major-0.1X |
Thanks for the suggestion.
by , 15 years ago
Attachment: | trac-no-robots-on-stock-pages.patch added |
---|
comment:3 by , 15 years ago
Cc: | added |
---|
The patch above implements this behavior, using the check for trac
user.
comment:4 by , 15 years ago
Milestone: | next-major-0.1X → 0.12 |
---|---|
Resolution: | → fixed |
Status: | new → closed |
Applied in r9137, thanks!
comment:5 by , 15 years ago
Owner: | set to |
---|
comment:6 by , 15 years ago
ingenius, that's all i have to say. i mean, this been bothering me for a looong bit.
Replying to mpotter@…:
Actually that technique of "trac" modified documents would provide the desired affect of allowing indexing here since the documents here are not modified by "trac".