Data scraping

Data scraping is the automated process of extracting data from websites using specialized programs or scripts (“scrapers”). Instead of manually opening pages, copying text, and saving it—scrapers do this automatically for thousands of pages. While sounding useful, data scraping has serious legal and ethical implications.

Practical example: Someone creates a script that goes to Amazon, extracts product information (name, price, reviews), and stores it in their own database. Then uses that data to create competing website. That’s data scraping, and in most cases, it’s illegal because Amazon’s Terms of Service prohibit it.

Why data scraping is problematic: (1) Copyright violation—data on pages is often copyright-protected; (2) Terms of Service violation—almost all websites have Terms prohibiting scraping; (3) Server overload—aggressive scraping can disable servers; (4) Intellectual property theft—stealing content like text, images, code; (5) Unfair competition—using data to make competing product.

However, legal forms of data scraping exist: (1) Public data—if data is publicly available and no Terms prohibit scraping, it may be legal; (2) With permission—if website permits scraping (some have APIs for it); (3) Fair use—certain academic or journalistic uses may qualify as “fair use”.

For startups: If needing data, better to request APIs from websites (like Twitter API, Google Places API) instead of scraping. It’s legal, safe, and comes with data agreement.

Contact us

Fill in the form below and our team will contact you as soon as possible. We aim to respond within 24 hours on business days.

Contact - EN

Phone

+381 11 2417 566

Instagram

@naumovicipartneri

Working hours

Mon - Fri: 09 - 17

Email

office@naumovic-partners.com

Linkedin

Naumovic & Partners Law Office