Cloudflare Training

Optimize Cloudflare-hosted sites for AI training.

Websites hosted by Cloudflare or other service providers may be blocking automated web traffic, making them inaccessible to LLM training. This can be modified with additional configuration in their platforms. This guide is for Cloudflare, but the principles generally apply to other providers.


1

Go to Cloudflare website

2

Go to Security → WAF → Tools

3

Enter Proto's IP address

a) In the IP, IP Range, country name, or ASN field, insert 20.198.250.74 .

b) Set Action to Allow.

4

Back in Proto AICX, go to AI Assistants → LLM → Add URL

a) Go to AI AssistantsLLMAdd URL

b) Insert the URL

c) Set the Enable proxy bypass toggle to be turned off (disabled)

The Cloudflare-hosted website should now be accessible for LLM training.

Last updated