Boris Anthony and Hugh McGuire discuss how much more becomes possible when we open books onto the web.
Take a book down from the shelf, or pick it up off the table. Open it and read a line. With a pencil, underline something, or make a note in the margin. In the moment that we do these things, we use basic technologies—ones we’ve had for a long time. We are also then using the outcome of a great number of other technologies which exist to bring books, shelves and pencils to us. We don’t think of all that when we read a book, glance at our library, or make a note for later. Once a book is ours, we don’t need any of that enabling technology to use it, to have it, to make it part of our life’s experience.
Now, take an ebook. What is an ebook, really? It too relies on an elaborate infrastructure of supporting technologies. But, rather than fade into the background, these technologies intertwine and bind themselves to ebooks, defining and restricting how we experience them. If it weren’t for specialised devices, apps, and e-commerce stores, would we know how to read an ebook? A deep shift is taking place behind a technological smoke screen of business development. We have just begun to untangle the mess of implications this has for the publishing industry, the mediation of knowledge, the structure and tenor of our societies, as well as the experience of reading, sharing, and recalling.
When we possess a book, myriad experiences are ours—the store we bought it from, the friend who recommended it over dinner, the passage that sparked our imagination or changed our lives. The ghostly existence of digital media, often lost deep within clinical software labyrinths not of our making—nor necessarily of our cultural experience—is flat. It leaves no surface or texture for our memory and cognition, developed over millennia to contextualise physical experience, to turn ebooks into the richly ornate totems of life lived that we’re used to.
All we can do is read our ebooks within the narrowly defined environments provided by the technology developer. These environments are constrained into stingy tenant-farmer/landlord-like business models in which we don’t own the ebooks we read, nor our experience of doing so, but merely rent them from the companies who do.
However, rather than restricting our thinking to this realm—which is admittedly fascinating and hard to turn away from—we need to begin charting what by all indications is a vast new world. By bringing ebooks onto the Open Web, they fill with the promise of so much more opportunity and value, for everyone.
Existing attempts to digitise books—PDF, ePub, Kindle—suffer from varying combinations of key limitations. Born as skeuomorphic analogs of their printed counterparts, they tend to be more visually oriented than meaningfully useable, technically complex, or proprietary and locked up. None are networked, or exist in a persistent way on the Open Web; their content cannot be directly accessed, referenced or manipulated.
A Web-based book is not limited in this way. Nor is it simply a webpage or a website. A webbook is a discrete, self-contained collection of structured documents. It is portable, from one system to another, but identifiable as a copy of a specific work, and edition thereof. Once we’ve solved these aspect, we’ll start seeing revisions, annotations, manipulations, derivative works.
The technical needs of web-based books are currently being investigated by such internet and digital standards organisations as the W3C (HTML, HTTP, URI) and the IDPF (ePub). It’s an evolutionary step for a medium whose time has come. *
While driving through the Northern Appalachians of New Hampshire, old friend and Rebus Foundation co-founder, Hugh McGuire and I discussed how open and networked digital book technologies and value flow models could represent a broader array of opportunity, as well as depth of experience, than has been suggested by developments thus far.
* Since writing this, the W3C and the IDPF have combined their efforts, “herald[ing a] future for how we will read, author, publish, and discover content and services with Web technologies”.
Let’s talk about why books on the web are important. Why breaking out of the current ebook reality—with its inaccessible media technologies and locked-down content licenses—is important. To start, we might define an “open web book” as a discrete copy of a web-accessible package of HTML files, text, and other media. It includes structural information and metadata. Such a web book can be collected and managed independently of its producer or provider. We imagine it to follow the age-old model of physical books, where we can take our copy home, put it on our own shelves, into our own library, and read it, without need for too much extraneous technology.
Books are important to me. We hang big parts of our ideas and notions on books in a way we don’t with, say, web pages. So books as an intellectual tool have a representative role in our intellectual lives, and how we think about ourselves. That fact is the driving force for me behind my interest in this. Books, as things that I experience, have an important role in how I think about myself. When we think about ebooks, as people who work with digital technology, we quickly end up asking: why can’t I do more to build on the relationships between myself and ebooks? I’d like to have access to, or build, more tools, and explore these relationships more. Paper is limited as a physical medium; it’s not connected to the network. Ebooks are limited by vendor silos, Digital Rights Management (DRM) lock-in, and what Amazon or Apple will let you do with them. Yet books are a central set of nodes in how I define myself, and I would like to explore my relationship with books using digital tools. But I can’t.
It’s interesting to hear you express how books are “nodes of how you think of yourself.” Books are these things we experience in so many ways. With the physical object, we remember where we bought it, what prompted us to get it. Maybe someone gave it to us as a gift. And that’s just with the acquisition of the book. There are the ideas that the book creates in our minds, the memories of the time when we were reading it, that time when we pulled it off the shelf to show a passage to a visiting friend. These are all experiences, and they add to who we are, to our life experience. As you say, physical books give us all these things, but they are limited, especially for those of us who work with digital, with the web. We know there is so much more that can be done with information. And we turn and look at ebooks, and think, “Well, all I can really do with this is read it.”
You’ve said this before; for certain kinds of books, just reading it is only one small part of what that book represents in your life. There’s so much more. You’ve often used this notion of the ROI, the “return on investment”, of acquiring and owning a book; the idea that there is so much more value than just reading a book once or twice. Ebooks have done two things arguably well: they make it easy to get access to books, and they’ve done a fair job for raw reading. But there are all these other experiences around books that ebooks are not helping us engage with and get value from. And for me this is such a source of frustration: that we could be getting more value, but the model that exists in commercial ebook publishing just does not allow for anyone to build the tools to explore these ideas.
Let’s take as an example what we might call “quantified reading.” Aspects of the act of reading itself that we might capture: what time did I start this reading session, what time did it end, where was I, what have I annotated, and how many people do I know who have annotated this as well? This can all be interesting as a reader, and a company called Readmill tried to do this recently. They ultimately came up against a wall though, not because those people who used the service didn’t love it, but because they were trying to do this within the current model of commercial ebook publishing, with closed technologies and finance-based business models. And so the question becomes, “How do we create an infrastructure where a Readmill can thrive, where a reader can elect to plug into that system without breaking laws?” This simply is not the case right now.
Of course it goes beyond just tracking your behaviour with a book. We can make the text itself—as well as the ebooks and our collections of them— more directly tangible, interface-able, more deeply, cognitively graspable. We can get so much more out of it all than just quickly reading a file. Our hunch is that there is so much more value buried deep in ebooks—in their reading as well as their possession. But before we can explore in any scaled way, we have to build the infrastructure. Because the current ecosystem is all locked up with tech silos, DRM, and protectionist business models.
It’s a strange thing to have spent years working with a concept of what they now call “the Open Web”, building tools around information on an open platform, to then compare that to the current ebook ecosystem. We take this particular class of information, books, and we block it off from accessing and being accessible by that open platform of the web. Blogging for instance, where many of the people involved in creating the information were also building the bits and pieces around it, exploring how we could interact with it in new ways. Coming from that, we’re left with this sense of frustration, that we aren’t able to do that kind of creating, consuming, and building of new experiences with ebooks.
I was one of those people, and having been involved for instance in globalvoices.org, which connected into digital and data journalism. I look at The New York Times now, which since then had a whole department that explored how stories can be told and read differently. A lot of the ethos that came out of blogging went into the journalism world. It made sense, it felt natural, it was ripe for the picking. But I see also a rich cultural overlap, in places like Cambridge and New York, where this community of digital journalists and book hackers intersect. If we said, “Hey, we want to do some cartographic map or timeline or social graph views of the content of books,” they might say, “Oh, yeah, interesting, how can we do that?” Well, first we need some good interesting books and then we need to have those books be part of the web somehow. I sense that these concepts, which have already been integrated into the practices of digital journalism, are now ready to be applied toward digital books as well. It’s time to do this.
It’s an interesting historical parallel, because I think the commercial book world moved the way they did—and built the ebook world the way they did—in direct reaction to the chaos that came to music and journalism with digital. The book industry said, “We can’t go the way of the music industry, where the whole business model exploded because of technology that was out of their control. We can’t go the route of journalism, where everything went online, and then business models started falling apart.” So the book world, shrewdly, said, “OK, we’ll just replicate what we do in the physical world: package things up, make sure everything is encapsulated, sell units through retailers, the way we used to go through Barnes & Noble now we’ll go through Amazon.” That way they managed to keep a supply chain and business model that looks rather similar to what it looked like twenty years ago, in a way that none of the other media businesses have.
That’s the force that we’re running up against, a business model that has been well protected, and well thought out. Our frustration as people who build on top of information is that we could be doing more, delivering so much more value to people.
This is a good place to dive into some notions of where “more” might come from. I mention this often and it’s one of my favourite questions: “When you read, is text all you see?” Of course we see text as our eyes move across it, but our mind sees something of what the author hoped to get across to us. If we’re up to the task, we experience the somewhat magical phenomenon of our imagination recreating those ideas, emotions, notions, and experiences encoded in the text. What can we add to the reading experience that helps us get more directly to that point of communication?
Without distracting from the focus of reading.
And without resorting to forms of attention-grabbing media like video or interactive widgets which do the work for you. The synthesis has to happen in the reader’s mind, and figuring out the tools to do that effectively and delightfully is an exciting challenge.
I read something somewhere recently along the lines of, “Reading is so crazy, we look at little black marks scratched on smushed up wood and start hallucinating.” Reading is such a fundamental part of the way modern society operates that we forget how magical the process is. Part of the problem is we don’t know what those magical things are. There’s a lot of experimentation needed to figure out and “design” ways to make this digital reading experience more valuable.
As a designer, it’s a great challenge to make screen-based experiences which engage not only the reader’s semantic cognition but also their spatial, temporal, and social cognitions, as well as help synthesise all that into new knowledge. We can imagine supporting the absorption of a text by including different mediated views of its content. Some approaches spring to mind that aren’t necessarily new but could be easily generated for any given book. For example a cartographic map of places mentioned, accessible at any moment, with a “we are here now” zoom in. Other visualisations such as explorable social graphs of characters, or timelines—structural and chronological, thematic relationship colouring; all these would visually reinforce the understanding we seek when we read. They could also represent alternative means of navigating a book, beyond the existing structural table of contents.
There’s an argument to be made that maybe books just aren’t that good at doing this kind of thing for some people. That we think reading is important and people should have deeper experiences with books, where maybe they’re perfectly happy not to. Perhaps this also represents a danger. A danger of idealising books.
Idealising the book is not going to get us far. Myself I’m not romantic about it; I don’t swoon at the smell of the ink or artisanal binding crafts. I do however try to understand what makes a book—that discrete, self-contained, and portable package of ideas—as a medium, different and unique from other media. For instance, books have proven resilient across time. An open, text-based ebook and all its metadata—which is also text—can be printed and stored in a vault. Twenty generations later, it can be opened and read without any special technology beyond recognition of the symbols, syntax, and grammar of old languages. Also, text is totally formal. What I mean by that is that finding all the instances of a word in a book is trivial, compared to doing the same in a video. And really, text is itself a foundational part of all our digital technologies. Everything digital is done in text. Before it is “machine readable” it still has to be “human readable.” I think that’s powerful.
There are a few things I’d like in my reading life that would deliver more value to me, such as having the access and control needed to own and manage my annotations and my personal e-library. There are layers, from such obvious things to do, as you mentioned, to the wider world of experiences that could be built and which would deliver more value in ways that we can’t really name right now, especially once network effects take hold. If everyone’s interactions with their copies of a book feed back up to their source—their creator and publisher—they can begin to engage a distributed conversation and exploration.
It’s simply not good enough to just go and build another reader app, another ebook store or stuff like that. A huge part of the work that needs to be done is adding the necessary infrastructure to the existing web to make books first-class citizens on the network, and beyond that: figuring out what equitable value flows and business models can bring all this to life.
Finally, deeply important with this notion of books on the web is that all these ideas we’ve been talking about are still largely abstract and we need a demonstration of real value in these principles and concepts. That can only happen with an available, accessible, and desirable corpus of books that can be explored in these ways, by many engaged readers.