Robert R George 
							
						 
					 
					
						
						
							
						
						a98a8ee93f 
					 
					
						
						
							
							Update robots.txt to prevent crawling of domain blocks ( #26470 )  
						
						 
						
						... 
						
						
						
						Co-authored-by: Claire <claire.github-309c@sitedethib.com > 
						
						
					 
					
						2024-12-02 08:03:24 +00:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Foritus 
							
						 
					 
					
						
						
							
						
						405f141fe0 
					 
					
						
						
							
							Change: Block GPTBot ( #26396 )  
						
						 
						
						
						
						
					 
					
						2023-08-09 11:58:46 +02:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								ThibG 
							
						 
					 
					
						
						
							
						
						c4f2433300 
					 
					
						
						
							
							Disallow robots from indexing /interact/ ( #10666 )  
						
						 
						
						... 
						
						
						
						This does not provide any new information and may just triple the number
of crawled pages 
						
						
					 
					
						2019-05-02 00:10:19 +02:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								nightpool 
							
						 
					 
					
						
						
							
						
						a5992e5883 
					 
					
						
						
							
							Change robots.txt to exclude only media proxy URLs ( #10038 )  
						
						 
						
						... 
						
						
						
						* Revert "Change robots.txt to exclude some URLs (#10037 )"
This reverts commit 80161f4351 .
* Let's block media_proxy
/media_proxy/ is a dynamic route used for requesting uncached media, so it's
probably bad to let crawlers use it
* misleading comment 
						
						
					 
					
						2019-02-14 03:11:47 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eugen Rochko 
							
						 
					 
					
						
						
							
						
						80161f4351 
					 
					
						
						
							
							Change robots.txt to exclude some URLs ( #10037 )  
						
						 
						
						... 
						
						
						
						- Exclude static assets
- Exclude uploaded files
- Exclude alternate versions of the profile page
- Exclude media proxy URLs 
						
						
					 
					
						2019-02-13 21:28:18 +01:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eugen Rochko 
							
						 
					 
					
						
						
							
						
						9c4856bdb1 
					 
					
						
						
							
							Initial commit  
						
						 
						
						
						
						
					 
					
						2016-02-20 22:53:20 +01:00