Web scraping is that activity where a party uses automated software to crawl the internet and copy data and other content, usually so that it can compile that together and make its own product offering. This may be of concern to you because you are a company that does web scraping. Or you may be a web publisher and there are other parties that are scraping your content. Let’s examine some of the legal issues around web scraping.
Breach of contract
One of the questions that commonly arises around web scraping is whether the activity is a breach of contract. More specifically, the question is whether the use of automated software violates the terms of service of the website that is being scraped. You often see website terms of service prohibit the use of spider and other automated crawling software to access and use the site. Parties who own websites that are being scraped will often look to see whether the scraping of their site is a breach of contract.
Copyright infringement
Another common question arising when analyzing web scraping is lawful whether scraping constitutes copyright infringement. This is a difficult argument to make if all that is being scraped is data, because mere facts usually are not subject to copyright protection. But if there is other content being scraped, such as images or specific compilations of data, the question of copyright infringement becomes a bit easier to answer in that unauthorized copying is an likely an infringement.
Computer Fraud and Abuse Act
The Computer Fraud and Abuse Act is another topic that often comes up in discussions about web scraping. This is a federal law that makes it unlawful for a person to access a protected computer without authorization, or in excess of a specific authorization. So the question becomes whether that access by the automated web scraper violates the Computer Fraud and Abuse Act. There are some important things that have to be proven for a plaintiff to succeed under the Computer Fraud and Abuse Act, and one of those is loss or damage that results from the unauthorized access. It is a very fact intensive inquiry that has to be made, but the Computer Fraud and Abuse Act is one thing that parties should think about in the context of web scraping.
Trade secrets
The question of trade secrets is another good one to raise in the context of web scraping. A trade secret is any information that a company has that gives that company a commercial advantage in the marketplace because it is secret. The information also has to be the subject of protective efforts — the company has to try to keep the information secret. For example, if information on a website is put there in a way that is behind certain protective barrier,s and the party doing the scraping circumvents those barriers, it could be that there is a misappropriation of trade secrets, particularly if that information is used for some competitive purpose.
Let’s talk
Web scraping legal issues can be complex. Scraping presents certain legal risks to the ones doing it, and the law provides certain powerful remedies when web scraping runs afoul of the rules. If you have questions about web scraping, give me a call at (630) 362-7237, or send me an email at ebrown@internetcases.com.
About the author
Evan Brown is a technology and intellectual property attorney in Chicago. This content originally appeared on evan.law.