AIPREF
AIPREF Generator
Control AI usage of your content
๐Ÿ“–

How AIPREF Works

A complete guide to implementing AI preferences using the IETF AIPREF standard

Overview

AIPREF (AI Preferences) is an IETF standard that allows content owners to declare how automated systems and AI models may use their content. The standard uses the Content-Usage header field to communicate preferences about automated processing, AI training, generative AI training, and search indexing.

Implementation Process

1

Define Your Preferences

Choose preferences for four categories: bots (automated processing), train-ai (AI training), train-genai (generative AI training), and search (search indexing). Each can be set to Allow (y), Disallow (n), or Unstated.

2

Generate Configuration

Use the AIPREF Generator to create properly formatted Content-Usage headers, robots.txt directives, or JSON configuration files that comply with RFC 9651 Structured Fields specification.

3

Deploy Configuration

Add the Content-Usage header to your HTTP responses via web server configuration (Nginx, Apache) or application middleware. Alternatively, add Content-Usage directives to your robots.txt file following RFC 9309 Robots Exclusion Protocol.

How AI Systems Discover Preferences

AI systems and automated crawlers discover your AIPREF preferences through two standard methods:

1. HTTP Headers (Preferred Method)

When a system requests content from your server, the Content-Usage header is included in the HTTP response. This applies to the specific resource being accessed and takes precedence over robots.txt.

HTTP/1.1 200 OK
Content-Type: text/html
Content-Usage: train-ai=n, train-genai=n
...

2. robots.txt Discovery

Before crawling, compliant systems check your robots.txt file at the root of your domain. Content-Usage directives in robots.txt apply to URL patterns specified in the User-Agent section.

User-Agent: *
Allow: /
Content-Usage: train-ai=n, train-genai=n

Preference Conflict Resolution

When multiple Content-Usage preferences apply to the same content, the AIPREF standard defines clear resolution rules:

1. Most Restrictive Wins

When preferences conflict, the most restrictive preference takes precedence. Disallow (n) overrides Allow (y), and explicit preferences override unstated ones.

2. Specific Over General

More specific preference categories take precedence over general ones. For example, train-genai=n overrides train-ai=y for generative AI training.

3. HTTP Header Over robots.txt

Content-Usage headers attached to HTTP responses take precedence over Content-Usage directives in robots.txt for the same resource.

Technical Compliance

AIPREF implementations must comply with established IETF standards:

  • RFC 9651 - Structured Fields for HTTP: Defines syntax for Content-Usage header values
  • RFC 9309 - Robots Exclusion Protocol: Specifies robots.txt format and discovery
  • AIPREF Vocabulary - Defines the four preference categories and their semantics
  • AIPREF Attachment - Specifies how preferences attach to content via headers and robots.txt

Ready to Implement?

Use the AIPREF Generator to create your configuration in minutes.

Go to Generator โ†’

Learn More