Skip to main content

Scribd has a Youtube Problem

Scribd hasscribd-logo-blk_100x28 a bad reputation (from its early days as a document hosting site) for being a piracy haven, and to combat piracy Scribd has adopted an automated that checks user uploads for pirated content.

Unfortunately that system, like a similar system at Youtube, works a little too well. There are several reports over on the Smashwords blog from Scribd users whose documents were removed from the site by Scribd’s automated system.

When Smashwords signed up to distribute ebooks to Scribd’s ebook subscription service last Fall, one of the side effects of the deal was that ebooks from Smashwords were regarded by Scribd’s system as being an original source.

This has led to more than a few problems. Whenever an author quoted a court document, public domain work, or other legitimately copyright-free document in their book, Scribd logs the quoted text as belonging to the author and their automated system flags and removes any user-uploaded documents that contain the same text.

Several complaints have surfaced on the Smashwords blog over the past couple weeks. For example:

I have a slightly different purview on this whole subject. Many of my document posts on Scribd are 1)historical documents, long out of copyright protection, 2) government or legal documents, not protected by copyright laws, 3) public domain documents in which the author has granted free copy rights to all. I find many times these documents will get taken down by Scribd, and I suspect that is because some author has included quotes from the documents within their copyrighted books. So, even in the world of copyright protection, there are improvements that must be considered. Not everything is black and white.

This type of criticism should sound familiar to anyone who follows copyright news, an in particular Youtube, whose ContentID system is often the focal point of complaints from both media companies and uploaders.

ContentID has had its own share of mistakenly removed videos, and in fact this kind of error gets news coverage about once every other month. The most recent one to cross my news feeds was a report about a video of US Congressional committee hearings being removed as a result of copyright claims by Telemundo and Univision. And only a few months before that, Youtube started taking down user uploaded gameplay videos en masse, even though the videos were arguably fair use and were often encouraged by the game developers.

And those are just the most recent cases; I have been reading about similar issues for over 4 years.

Today’s news about Scribd has only reinforced this blogger’s negative opinion of automated systems like ContentID. They make it far too easy for legit content to be blithely removed without involving even a single person in the decision.

Thanks, Michael!

Similar Articles


Valentine April 2, 2014 um 1:38 pm

Ah yes, gameplay videos. Reminds me of the times, I printed walkthroughs at work on a matrix printer … (~15 years ago).

Felipe Adan Lerma April 3, 2014 um 8:52 pm

I posted this on the SW blog:

…as per Digital Reader, this is not a Scribd / Smashwords isolated problem.

Plus I’ve heard reports from other authors about other distributors having the same problems.

I lost links from over a dozen blog posts, and a widely distributed press release, so I’m not just standing by watching. And I’m still finding books needing replacing.

But, for me, this is stumble steps in something I’ve felt will be very valuable for me (and am seeing some of that evidence already.)

So with that, I wish us ALL the best! 😉


I think your final point in your post "…automated systems..make it far too easy for legit content to be blithely removed without involving even a single person in the decision" is a needed call for refinement.

Maybe a two-step process, with notification for clarification to the author done first before removal. Or the placing of the title in question in some sort of holding folder while notification is done.

Being these automated systems are probably more a part of our digital future than not, it’s time to add a layer of human mediation.

And I know for my part, without Scribd’s author support staff, it’s unlikely I’d be surviving this distribution – to them my deepest thanks!

Scribd One Month Later : 57 Uploads – Problems Fixes and Expectations | Felipe Adan Lerma April 16, 2014 um 9:03 pm

[…] Digital Reader has an excellent article, detailing better than I can, what went wrong.  And evidently, it’s not a Scribd or Smashwords specific problem, even if it did specifically affect me 😉 […]

The Morning Coffee – 17 April 2014 – The Digital Reader April 17, 2014 um 12:30 am

[…] (link), telephone box libraries (link), the rising cost of education in the UK (link), a twist on Scribd’s Youtube problem that I had not considered (link), and […]

Smashwords Details Scribd's New Anti-Piracy Efforts – The Digital Reader May 12, 2014 um 9:10 am

[…] it’s not clear just how many of those copies were actually pirated copies. There have been numerous reports that Scribd has taken down public domain works and works uploaded by their […]

BookID, Book IDunno | Subscription State of Mind May 29, 2014 um 2:51 pm

[…] there was no data on the validity of each of those cases. In my search for that data, I found an article from Nate Hoffelder at The Digital Reader, comparing BookID to YouTube’s ContentID. Scribd’s […]

Amazon May Have "a Serious eBook Theft Problem", But They're Not Alone ⋆ The Digital Reader December 9, 2014 um 12:34 pm

[…] As we all know, Youtube's ContentID system has caught any number of pirates while also punishing innocents. The latter have often lost access to their account and had little recourse. What's more, when Scribd brought their monitoring service online earlier this year, they too made the mistake of yanking content without checking. […]

Juan Lawrence Leynor March 13, 2018 um 12:48 am

Scribd should have people or a system to see if part of the books are in the public domain. Maybe both.

Write a Comment