Building Resilient Systems on AWS: Learn how to design and implement a resilient, highly available, fault-tolerant infrastructure on AWS.

Prevent Robot Indexing with Response Headers

By David Walsh on September 12, 2012

Every so often you have parts of your website that would be better off not indexed by search engines. API calls, search result pages, PDF documents -- all examples of responses which may not have value outside of the current user. No we all know we can signal to the search engines not to index pages using a META tag, but oftentimes service calls and documents don't get the luxury of a META tag. Luckily you can add a header to prevent these responses from being indexed.

The header name is X-Robots-Tag should be easy to add using the server-side language you prefer. For example, adding this header with PHP may look like:

header('X-Robots-Tag: noindex');

If you're using a Django-based python site, the could would look like:

response['X-Robots-Tag'] = 'noindex'

This header can also be set within your .htaccess or httpd configuration files:

<Files ~ "\.pdf$">
  Header set X-Robots-Tag "noindex"
</Files>

The truth is that there's no guarantee that something your server serves wont be indexed by a search engine, but small tweaks like this can ensure your search engine standing can improve and that users don't find their way to "dead" parts of your site via search engines.

Recent Features

By David WalshNovember 7, 2011
Create Spinning Rays with CSS3: Revisited
Last December I wrote a blog post titled Create Spinning Rays with CSS3 Animations & JavaScript where I explained how easy it was to create a spinning rays animation with a bit of CSS and JavaScript. The post became quite popular so I...
By David WalshJune 2, 2015
7 Essential JavaScript Functions
I remember the early days of JavaScript where you needed a simple function for just about everything because the browser vendors implemented features differently, and not just edge features, basic features, like addEventListener and attachEvent. Times have changed but there are still a few functions each developer should...

Incredible Demos

By David WalshFebruary 16, 2012
Use Elements as Background Images with -moz-element
We all know that each browser vendor takes the liberty of implementing their own CSS and JavaScript features, and I'm thankful for that. Mozilla and WebKit have come out with some interesting proprietary CSS properties, and since we all know that cementing standards...
By David WalshJune 1, 2009
Create Custom Events in MooTools 1.2
Javascript has a number of native events like "mouseover," "mouseout", "click", and so on. What if you want to create your own events though? Creating events using MooTools is as easy as it gets. The MooTools JavaScript What's great about creating custom events in MooTools is...

Discussion

Chris
I have a big problem with spam registration on an ExpressionEngine site I help manage. Could this help? I have no development experience, fyi…
Jason Eccles
This a really old post, but for those who are using nGinx instead of Apache, you can do
```
location ~* \.(doc|pdf)$ {
    add_header  X-Robots-Tag "noindex, noarchive, nosnippet";
}
```

Prevent Robot Indexing with Response Headers

Recent Features

Create Spinning Rays with CSS3: Revisited

7 Essential JavaScript Functions

Incredible Demos

Use Elements as Background Images with -moz-element

Create Custom Events in MooTools 1.2

Discussion