############## # Goodbye AI # ############## # Everyone knows AI companies are trying to aquire every bit of txt and jpeg # they can get their grubby robot hands on to train their Large Language # Models (LLMs). Creative types have been trying to fight back by "poisoning" # their art with junk data. # AI companies are so desperate for all of the data, they are claiming that if # a website allows their pages to be findable in a search engine I.E, by # letting google scrape some data to categorize and rank your page, # you are consenting to having your data be trained on. # Here is Hank Green explaining the situation: # https://youtu.be/JiMXb2NkAxQ?si=Qdpx5OyquAqlKYmB&t=960 # So, I have decided to disallow all robots from seeing all of this site # except for the landing page, which is text only ASCII art. # If you want to make it legally harder for AI companies to scrape your data, # all you have to do is have a 'robots.txt' file in the root directory of your # website, and include the following lines: User-agent: * Disallow: /s # this translates to "All robots, you are not allowed in the zone directory" # Or you can use "Disallow: /" if you do not want to have an open landing page. # But be warned, this means search engines can not rank your page... # But they only show legacy media pages now anyway, its not like google would # have ever shown you this site anyway. # ~ WR