Edgewall Software
Modify

Opened 15 years ago

Closed 15 years ago

Last modified 15 years ago

#8876 closed enhancement (fixed)

Disallow web robots indexing of Base Documentation Pages

Reported by: mpotter@… Owned by: Vaclav Slavik <vslavik@…>
Priority: normal Milestone: 0.12
Component: wiki system Version:
Severity: normal Keywords: newhelp consider
Cc: vslavik@… Branch:
Release Notes:
API Changes:
Internal Changes:

Description

When searching for information on Trac, I often end up swimming through multiple instances of the same page on different Trac installations. Example (I've seen worse, but this was the one I just hit): When doing a Google search for Trac "component remove", 7 out of the 10 links return on the first page are the TracAdmin page from different sites.

Suggestion: If a page is a base documentation page (unless here at trac.edgewall.org), send html headers to instruct searchbots to not index.

<META NAME="ROBOTS" CONTENT="NOINDEX">

How to determine if a page is a base documentation page I am not sure. One possibility:

  • A page that was last modified by "trac", i.e. a page that the Trac install or upgrade made or modified.

Attachments (1)

trac-no-robots-on-stock-pages.patch (657 bytes ) - added by Vaclav Slavik <vslavik@…> 15 years ago.

Download all attachments as: .zip

Change History (7)

in reply to:  description comment:1 by mpotter@…, 15 years ago

Replying to mpotter@…:

Suggestion: If a page is a base documentation page (unless here at trac.edgewall.org), send html headers to instruct searchbots to not index.
[…]
One possibility:

  • A page that was last modified by "trac", i.e. a page that the Trac install or upgrade made or modified.

Actually that technique of "trac" modified documents would provide the desired affect of allowing indexing here since the documents here are not modified by "trac".

comment:2 by Christian Boos, 15 years ago

Keywords: newhelp consider added
Milestone: next-major-0.1X

Thanks for the suggestion.

by Vaclav Slavik <vslavik@…>, 15 years ago

comment:3 by Vaclav Slavik <vslavik@…>, 15 years ago

Cc: vslavik@… added

The patch above implements this behavior, using the check for trac user.

comment:4 by Christian Boos, 15 years ago

Milestone: next-major-0.1X0.12
Resolution: fixed
Status: newclosed

Applied in r9137, thanks!

comment:5 by Christian Boos, 15 years ago

Owner: set to Vaclav Slavik <vslavik@…>

comment:6 by lkraav <leho@…>, 15 years ago

ingenius, that's all i have to say. i mean, this been bothering me for a looong bit.

Modify Ticket

Change Properties
Set your email in Preferences
Action
as closed The owner will remain Vaclav Slavik <vslavik@…>.
The resolution will be deleted. Next status will be 'reopened'.
to The owner will be changed from Vaclav Slavik <vslavik@…> to the specified user.

Add Comment


E-mail address and name can be saved in the Preferences .
 
Note: See TracTickets for help on using tickets.