Context Navigation

Modify ↓

#8876 closed enhancement (fixed)

Disallow web robots indexing of Base Documentation Pages

Reported by:	mpotter@…	Owned by:	Vaclav Slavik <vslavik@…>
Priority:	normal	Milestone:	0.12
Component:	wiki system	Version:
Severity:	normal	Keywords:	newhelp consider
Cc:	vslavik@…	Branch:
Release Notes:
API Changes:
Internal Changes:

Description

When searching for information on Trac, I often end up swimming through multiple instances of the same page on different Trac installations. Example (I've seen worse, but this was the one I just hit): When doing a Google search for Trac "component remove", 7 out of the 10 links return on the first page are the TracAdmin page from different sites.

Suggestion: If a page is a base documentation page (unless here at trac.edgewall.org), send html headers to instruct searchbots to not index.

<META NAME="ROBOTS" CONTENT="NOINDEX">

How to determine if a page is a base documentation page I am not sure. One possibility:

A page that was last modified by "trac", i.e. a page that the Trac install or upgrade made or modified.

Attachments (1)

trac-no-robots-on-stock-pages.patch (657 bytes ) - added by Vaclav Slavik <vslavik@…> 16 years ago.

Download all attachments as: .zip

Change History (7)

in reply to: description comment:1 by mpotter@…, 16 years ago

Replying to mpotter@…:

Suggestion: If a page is a base documentation page (unless here at trac.edgewall.org), send html headers to instruct searchbots to not index.
[…]
One possibility:

A page that was last modified by "trac", i.e. a page that the Trac install or upgrade made or modified.

Actually that technique of "trac" modified documents would provide the desired affect of allowing indexing here since the documents here are not modified by "trac".

comment:2 by Christian Boos, 16 years ago

Keywords:	newhelp consider added
Milestone:	→ next-major-0.1X

Thanks for the suggestion.

by Vaclav Slavik <vslavik@…>, 16 years ago

Attachment:	trac-no-robots-on-stock-pages.patch added

comment:3 by Vaclav Slavik <vslavik@…>, 16 years ago

Cc:	vslavik@… added

The patch above implements this behavior, using the check for trac user.

comment:4 by Christian Boos, 16 years ago

Milestone:	next-major-0.1X → 0.12
Resolution:	→ fixed
Status:	new → closed

Applied in r9137, thanks!

comment:5 by Christian Boos, 16 years ago

Owner:	set to Vaclav Slavik <vslavik@…>

comment:6 by lkraav <leho@…>, 16 years ago

ingenius, that's all i have to say. i mean, this been bothering me for a looong bit.

Modify Ticket

Change Properties

Summary:
Description:	When searching for information on Trac, I often end up swimming through multiple instances of the same page on different Trac installations. Example (I've seen worse, but this was the one I just hit): When doing a Google search for [http://www.google.com/search?source=hp&q=trac+%22component+remove%22 Trac "component remove"], 7 out of the 10 links return on the first page are the TracAdmin page from different sites. Suggestion: If a page is a base documentation page (unless here at trac.edgewall.org), send html headers to instruct searchbots to not index. {{{ <META NAME="ROBOTS" CONTENT="NOINDEX"> }}} How to determine if a page is a base documentation page I am not sure. One possibility: * A page that was last modified by "trac", i.e. a page that the Trac install or upgrade made or modified. You may use WikiFormatting here.
Type:		Priority:
Milestone:		Component:
Version:		Severity:
Keywords:		Cc:	Set your email in Preferences
Branch:
Release Notes:
API Changes:
Internal Changes:

Action

leave as closed The owner will remain Vaclav Slavik <vslavik@…>.

reopen The resolution will be deleted. Next status will be 'reopened'.

change ownership to The owner will be changed from Vaclav Slavik <vslavik@…> to the specified user.

Add Comment

Your email or username:

E-mail address and name can be saved in the Preferences .

You may use WikiFormatting here.

Attachments ↑ Description ↑

Note: See TracTickets for help on using tickets.

Download in other formats: