Researchers Reveal Booxtream’s Digital Watermark DRM

8387608047_9e49150a7b_hThe general details about Booxtream's digital watermark DRM have been known since someone deconstructed a Harry Potter ebook from Pottermore in 2012, but have you ever wondered about the specific technical details?

Thanks to an anonymous hacker going by the name of Paigey the Book Pirate, now we know.

Late last night an email graced my inbox with a link to a file on Pastebin which detailed the various parts of Booxtream digital watermark DRM as used by Verso Books. I can't share that link (it had someone's personal info in it) but I do have a copy of the file for you sans PII.

booxtream.txt

The file is worth a read both for the technical details and for the humor. This is at least the second time I know of that someone has posted a detailed technical analysis of Booxtream DRM, but it is the first to use a humorous tone:

The Institute for Biblio-Immunology specialises in textual pathogen identification and antigen synthesis. Several vials of in vivo samples suffering from a "social DRM" watermarking infection were recently brought to the attention of our cellar scientists. In this, our inaugural communique, we will explore our dissection of said samples and offer an initial expatiation regarding the contaminant undesirables discovered therein, as well as offer preliminary guidance for a successful course of treatment.

...

Prudence tells us that the only time books should be used as weapons of terror is if they are thrown, gleefully aflame, through a publishing conglomerate's window. Instead, we find that the publishing company Verso Books is using books to facilitate the surveillance of readers. By embedding uniquely-identifiable personal information in individual copies of ebooks, Verso (and the company they are relying on for the actual watermarking, BooXtream) are turning vectors for cultural transmission into, effectively, tracking beacons designed to identify who is sharing said ebooks, so as to then neutralise said ostensibly undesirable (by Verso) knowledge transmission paths. This will not stand.

While I don't share Paigey's opinion about the evils of digital watermark DRM, I can appreciate their hard work.

The text file above details seven different ways that Booxtream adds identifiable info to an Epub. (Booxtream can also embed digital watermarks in a Mobi file which can be read on the Kindle, but that is not covered here.)

In addition to adding a unique serial number to the names of files found inside the Epub ebook, Booxtream also embeds the original buyer's name and email on the title page as well as in a footer at the end of each chapter. The digital watermarks can also be found in image metadata and the CSS file, and there's a time stamp which records the specific time the original ebook was downloaded.

All in all, this file is a great read for anyone who wants to know how they are being tracked as well as anyone who wants more details on digital watermark DRM.

It will probably not, however, be very useful for stripping the digital watermarks from an ebook you buy. Booxtream is already aware that some of their technical secrets have been revealed, and they will undoubtedly be taking steps to change how they apply digital watermark DRM.

image y Mark Morgan Trinidad A

About Nate Hoffelder (11598 Articles)
Nate Hoffelder is the founder and editor of The Digital Reader:"I've been into reading ebooks since forever, but I only got my first ereader in July 2007. Everything quickly spiraled out of control from there. Before I started this blog in January 2010 I covered ebooks, ebook readers, and digital publishing for about 2 years as a part of MobileRead Forums. It's a great community, and being a member is a joy. But I thought I could make something out of how I covered the news for MobileRead, so I started this blog."

23 Comments on Researchers Reveal Booxtream’s Digital Watermark DRM

  1. We started using another watermark DRM from Legimi – piracy in Poland undoubtedly is very high, that’s why we trusted experts who tested it on their domestic market. And we find it very clever underneath. It’s not an advertisment, I just recomend it because it’s good + they support audio and PDF files.
    http://biz.legimi.com/en/services/watermark/

  2. I am just sick and tired of people using the term “DRM” to suit their ignorance and/or agendas. Booxtream may have followed the lead of Bill McCoy of IDPF in calling its technology “social DRM,” but that’s about as far as it goes. Other vendors of e-book watermarking technology, such as Legimi (see above) and Digimarc, take pains to distinguish their technologies from actual DRM. You are doing no one any favors by misusing the term.

    Come on, even the EFF knows better than this (https://www.eff.org/press/mentions/2008/1/11-0).

    • I describe it as DRM because there is a whole host of publishers (Pottermore, and a bunch in Germany) who use digital watermarks when selling direct and harder forms of DRM on ebooks sold through Kindle, iBooks, etc. They are saying with their actions that the two forms, digital watermarks and encryption DRM, are equivalent.

      And given that there is a cost for applying digital watermarks or encryption DRM, I’d say they have more in common with each other than digital watermarks have with DRM-free.

      • Riiiiight, Nate.

        So by that rationale, music labels still use DRM on iTunes and Amazon because they use a form of DRM on Spotify. And Hollywood studios use DRM for late-window movies on free-to-air TV (and free streaming services like Crackle) because they use DRM on DVDs. So do all the self-published authors who give e-books away on their own websites while selling with DRM on Amazon. It’s all DRM, right Nate?

        Seriously. Did you just invent this now, or has that been your rationale all along?

  3. Oh, and O’Reilly uses (proprietary) watermarking on their PDFs. I’m sure they be thrilled to hear that someone is calling this technology “DRM.”

  4. The watermarking is applied for the purpose of helping publishers manage their works’ digital rights…so how, then is it not DRM? For all the noise anti-DRM advocates make, DRM actually does have an actual real acronym expansion, and the middle word in that one isn’t “restrictions.”

    • OK, so a copyright notice in an e-book (which is not required by law for a work to be protected by copyright) is DRM too, right?

      • The difference is copyright notices don’t attempt to actually enforce rights, just tell you about them in the hopes that you will follow them.
        Digital watermarks are supposed to be used to enforce the copyright holders rights, by allowing them to identify you if you start giving the digital item to pirates.

        • Right but not all watermarks have user identifying information. Such as the ones that Nate was confused/misinformed about in the music industry. The music industry has never (other than small scale experiments) used what we call session-based or transaction watermarks, just static watermarks that identify the retailer (e.g. iTunes), not the end customer.

          There are lots of ways to enforce copyrights without watermarks or (actual) DRM. Too many to enumerate here. That’s why calling something DRM that isn’t DRM is silly, or in this case, irresponsible.

          • “The music industry has never (other than small scale experiments) used what we call session-based or transaction watermarks, just static watermarks that identify the retailer (e.g. iTunes), not the end customer.”

            Oh, really?

            http://www.mattmontag.com/music/universals-audible-watermark

          • (to comment below about UMG’s music watermark): yes, really.

            That post is 100% consistent with what I said. It just says, “There’s a watermark.” It doesn’t say what data is embedded in the watermark. Like I said, it’s an identifier for the retailer. That’s another thing that Nate got wrong about music watermarks. In order to insert a watermark with user information, the retailer has to do it at transaction time. UMG’s watermarks are inserted by UMG, not by Apple or Amazon. UMG wanted to see if it could learn anything about whether unauthorized copies of its files tended to come from iTunes, Amazon, some other retailer, or none of the above; that’s why it inserts static retailer IDs as watermarks.

          • I’m sorry, but you are consistently factually wrong. Please do not spread incorrect information.

            UMG’s watermarks are not inserted by UMG, but by MarkRef, a third party vendor which develops–you guessed it–session-based audio watermarking tech.

            Some references for you:

            http://www.markany.com/eng/wp-content/uploads/library/brochure/1_Introduction_to_MarkAny.pdf

            https://www.google.com/patents/US20120255029

          • OK, they are inserted *on behalf of* UMG, not by UMG itself. This is a trivial distinction. Otherwise, nope. It’s not a session based watermark. It would have to be inserted by the retailer. It’s not. Your “sources” are a brochure stating the capabilities of the Korean company whose watermarking technology UMG is using. MarkAny is capable of doing session based watermarks. This is not one of them. The patent you cite is also irrelevant. UMG may have wanted to get retailers to insert session-based watermarks, but they wouldn’t, so UMG settled for a static watermark that it inserted (OK, had inserted on its behalf) into files being sent *to retailers*.

            Look, I actually know the people involved, and I know what they did and what they did not do. You don’t, obviously.

          • “The music industry has never (other than small scale experiments) used what we call session-based or transaction watermarks, just static watermarks that identify the retailer (e.g. iTunes), not the end customer. ”
            If that’s true, then I would agree calling those watermarks DRM is taking it a bit far.

            That doesn’t mean that the types of DRM being used in ebooks aren’t DRM.

  5. What happens if your Kindle gets stolen, DropBox hacked, etc.?
    If a file you bought gets stolen and ends up being posted online for pirates, does law enforcement end up knocking on your door?

  6. Sort of. Not law enforcement but a nastygram sent by the monitoring agency, if any, that the publisher has engaged to search whatever site it is for those watermarks. (Booxtream only does the watermarking; they don’t do the monitoring. Publishers that use Booxtream have to use another service to do that.) And the same thing would happen if the best friend to whom you emailed a copy of the file does the same thing.

    • So how can digital watermarks actually be useful in curbing piracy?
      Wouldn’t anyone trying to enforce it actually have to prove that the file wasn’t stolen from you?

  7. Not necessarily. Those are evidentiary questions that tend to be handled (literally) on a case by case basis. The one thing that’s for sure is that you’d need to hire a lawyer and file a lawsuit in order to find out the answer.

    • It does make me worried that some innocent people will end up having to pay a fine or something because they don’t have time to fight it, and/or can’t afford a good lawyer.

  8. Nate is far from the only one that considers Watermarking a form of DRM. It is in fact the mainstream definition of the technology.
    A simple internet search for “DRM watermarking” will confirm it, coughing up a zillion technical reports and academoc papers. Like this one:

    http://www.igi-global.com/chapter/watermarking-techniques-drm-applications/8495

    If anything, it is the deniers that are making it up.

    • No.

      Looks like that’s exactly what you did: a “simple internet search.” If you were to actually read the paper you cite, you’d see that it treats watermarking and DRM as separate things. To wit: “Finally, the use of watermarking systems in the framework of a DRM is deeply analyzed.”

      It’s possible to combine watermarking and DRM, as in the first SDMI system back around 1999, which included a DRM system that forced the reading of a watermark. But they are separate technologies.

Leave a comment

Your email address will not be published.


*