• Publishers
  • Learn
  • Blog
  • About
  • Contact
Menu

Kadaxis

139 Fulton Street, #703
New York, New York
Tools for Book Discovery and Marketing

​You're Custom Text Here

Kadaxis

  • Publishers
  • Learn
  • Blog
  • About
  • Contact

Simon & Schuster in the Shadows: When Private Equity Casts a Pall on Publishing's Legacy

November 25, 2023 Wren Willians

"A library ought to be the delivery room for the birth of ideas, a place where history comes to life." - Norman Cousins

What happens when the sanctity of the written word collides with the capitalist thirst for profit? This timely question comes to the fore as the storied publisher Simon & Schuster falls into the hands of private equity. KKR, the infamous "barbarians at the gate" of the 1980s leveraged buyout era, has struck a $2 billion deal to purchase the venerable house that bears the names of America's pioneering bookmen.

At first blush, this may seem no different than an earlier attempted mega-merger between Penguin Random House and Simon & Schuster which the Department of Justice blocked. But while that deal joined one media conglomerate with another, the entrance of private capital brings an entirely different set of motives. Rather than long-view goals of stable literary output, private equity hunger pangs demand immediate satisfaction. Can books survive an owner seeking to maximize short-term returns over more profound cultural contributions?

As an industry focused on launching new ideas into the world, often with unknown commercial prospects, publishing depends on patience and creative vision. But private equity operates on the principle of financial engineering - heaping debt onto acquisitions, extracting value through mass layoffs and discounts from suppliers, or worse, dismantling once-vibrant institutions to profit from their demise. While private equity presents itself as a sophisticated form of management, provided by operations gurus delivering surgical precision, their playbook too often spoils the very assets it aimed to nourish.

KKR's strategy likely involves doubling down on bankable celebrity authors and proven bestsellers, starving investment in unknown voices. But as Walt Whitman reminds us in Leaves of Grass, "There was never any more inception than there is now, Nor any more youth or age than there is now." The literary landscape depends on uplifting emerging voices, not just milking established brands. Where would we be without daring, independent houses willing to take a chance on the unfamiliar and unproven?

Great writing has always come from the margins. Frances Ellen Watkins Harper, an African American poet born to a free mother in 1825, self-published her first book at 20, going on to become a foundational writer of her time. Without invested patrons willing to nurture untested artists, we may never have enjoyed the brilliance of Maya Angelou who struggled for years without recognition. With voracious private equity at the helm, it is unlikely we'd see such risky bets.

Simon & Schuster itself took a gamble on Bob Dylan's "Tarantula" - an experimental prose collection panned by the literary elite as indulgent rambling. But they saw genius amidst the unpolished expression, standing behind creativity rather than dismissing the symbolic resonance. Such maverick moments would unlikely find green lights under an investment fund scanning for guaranteed blockbusters.

While earnings pressure may not spark immediate discernible changes, it could gradually drain the heartblood keeping literary culture alive. Masterpieces emerge at the fertile edges where creativity blends, worlds collide, niches blossom into genres - rather than solely from commercial formulas. Without space for glorious successes and magnificent failures, we risk losing vibrant ideas before their full form takes shape.

So what checks defend artistic expression from capitalist overreach? Antitrust regulation aims to address monopolistic threats, though regulatory capture often dulls the axe of oversight. But while legal boundaries can deter harmful consolidation, cultural stewardship begins with those directly empowered - authors, agents, editors and publishers themselves. Just as journalists adopted ethical guidelines to bolster public-interest integrity amid profit motives, those shepherding books have a special duty.

Storytelling sits on the frontlines of societal change, artfully challenging unjust power structures. Richard Wright's 1940 novel "Native Son" controversially humanized systemic racism through his protagonist Bigger Thomas in a landmark literary moment. Books like Robin DiAngelo's "White Fragility" drive necessary conversations on race today. Such daring works emerge not from greedy profiteering but moral conviction - an idealism writers, publishers and the literary community must protect.

At its heart, the written word forms bonds across the abyss of time, passing precious fire from one generation to the next. As Ursula K. Le Guin wrote in Bryn Mawr commencement remarks shortly before her death, "Books aren't just commodities; the profit motive is often in conflict with the aims of art." When beloved books sustain us, it remains not just through dazzling prose but the hopes and truths encoded in their pages. To sever books from their role as carriers of conscience risks profound social loss.

The deal's darker dimension, however, tells a larger story - private capital metastasizing into unfamiliar terrain, absentee owners extracting bounty from proud institutions and the vulnerable communities that depend on them. When profit pursues every sphere at the exclusion of animating social purpose, something vital slips away - a fragmentation of shared meaning into impersonal market segments.

But change emerges not just from criticism but moral imagination - the ability to envision improved alternatives. What publishing models might nourish creativity independent of fickle financial markets? Perhaps literary collectives, directly funded by devoted readers and liberated authors? The world of books has long relied on patronage and volunteerism - those who recognize that certain flames should not gutter for want of money. That spirit still flickers in independent bookstores, nonprofit presses, writer cooperatives and passionate readers.

Our epic tales remind us - the glory of victory depends on what we defend. Publishing houses represent living legacies that we must fight for, bastions of artistic expression not easily replaced. For the barbarians always lurk at the gates with blunt and greedy designs. But the ideals encoded in enduring literature can steel our moral courage - inspiring advocacy and collective care for the voices that light the human spirit.

Tags publishing, private equity, literary culture, artistic expression, capitalism, cultural stewardship

How Blockchain Could Impact the Publishing Industry

August 22, 2023 Sybil - Newsflurry

The publishing sector is now being influenced by blockchain and NFTs. NFTs are used by conventional publishers to create communities and interact with readers. Let's examine how blockchain has drastically altered the publishing sector.

In the realms of business and finance, blockchain technology is no longer a buzzword. The potential of blockchain technology has been brought to our attention throughout different business sectors because of the popularity of bitcoin. 

To unleash the potential of blockchains, numerous industry experts have recently analyzed the technology and developed use cases within their respective fields.

Technology has the ability to completely change how publishers, writers, distributors, and merchants conduct business in the publishing sector. The entire publishing supply chain will benefit from blockchain technology.

The digital transactions of peer-to-peer networks are recorded in blockchain, because it is a decentralized public ledger. It was initially created to act as the foundation for cryptocurrencies like Bitcoin, but it is currently widely used in a variety of applications.

One of the biggest and most significant industries in the global economy is publishing. It has been around for an extremely long time and is a multi-billion dollar market. With many of the largest brands adopting the internet, publishers have taken the lead in online product sales.

Some people in the industry believe that blockchain's advantages for publishing platforms could be the solution to a lot of the problems faced by the industry.

There are two significant ways how blockchain technology can transform publishing:

1. More Affordable Payment Alternatives for Content Producers
By eliminating the need for an intermediary in online transactions, blockchain technology increases the financial gain for authors, photographers, and editors. 'Smart contracts,' which automatically release funds when publishers approve the content, is also possible at the same time.

2. Assistance in Managing Digital Property
By granting intellectual property rights, such as the prohibition on readers being able to resell or even share a publication with another person, blockchain can help boost the popularity of digital publishing platforms.

Additionally, Blockchain can assist publishers in the following ways: 

  • Identify unauthorized sharing and stop it.

  • Maintain a certain set of rights and modify them in a way that will increase the possible profit from their current agreements.

  • Better collaboration will result from their ability to monitor all workflows in real time.

  • Improve digital advertising so that they can provide their audience with a reason to interact with their content.

Publishers should understand blockchain now, whether that is signing up for a social network or creating a cryptocurrency wallet. These are small efforts toward investigating this new, disruptive technology that may have a strong influence.

THE USE OF BLOCKCHAIN IN THE PUBLISHING INDUSTRY IS EXPANDING
Digital publication is likely to be disrupted by the blockchain. Simply by streamlining content distribution and enabling more efficient payments, this cutting-edge technology can address the problems that publishers have been dealing with for a very long time.

As of today, there are some forward-thinking initiatives using blockchain technology to build abrasive platforms for the publishing sector. The blockchain world is revolutionizing and several crypto pr service providers have changed the way we look at the publishing and distribution industry.

MORE ADVANTAGES FOR A PUBLISHING PLATFORM THAT IS USING BLOCKCHAIN Every sector of the economy, including publishing, might undergo a change, thanks to blockchain technology. Every industry that has embraced it has experienced success. The publishing sector may actually experience flexibility, integrity, and reliability thanks to its dynamic standard features.

SHARE OF PROFITS WITH STAKEHOLDERS
The advantages of blockchain technology will be experienced by all parties. Designers, editors, and other stakeholders will be compensated in accordance with the blockchain database regulations.

Blockchain technology allows for the permanent tracking and linking of a person's contributions to the advancement of publishing.

Smart contracts enable the digitization and automation of a wide range of processes. Without the need for manual assistance from human intermediaries, people and organizations can conduct business with one another.

It won't be necessary for someone to constantly fix all the bugs or mail checks, because contracts that are also on the blockchain will conduct and distribute sales among the stakeholders automatically.

BUILDING THE ECONOMY FOR PUBLISHERS AND AUTHORS
Authors must rely on third-party service providers to provide secure content delivery, banking transactions, or private internet activity.

The content market is significantly impacted by Amazon and App Stores. As a result, writers and developers don't get paid as much as they should for their efforts.

Blockchain technology, on the other hand, has the ability to deal with intermediaries.

Authors will be able to trace the dissemination of their content and establish an electronic cash system for users thanks to blockchain technology. Additionally, authors will take back control of their work, moving up from the position of content distributor to primary source.

ADMINISTRATION OF SUPPORTING PROJECTS
A book is not the only source of income for an author or publisher. Fan fiction, movie adaptations, and spin-offs can all generate money from a book that has been published.

Some of these supplementary activities are referred to as "accessory items," and they are employed to raise awareness of a book and maintain customers' interest in a brand's storyline. Authors and publishers will be able to secure these auxiliary projects for their benefit by using a blockchain.

Blockchain technology will make tracking easier because it is a decentralized ledger that allows any transaction to be carried out without seeking anyone's consent.

SMART WALLETS
By utilizing smart wallets, blockchain can provide a safe payment method for publishers and authors. These encrypted wallets will be able to work with cryptocurrencies or tokens that are already known to be utilized inside an ecosystem for a project.

Blockchain can also be used to develop encrypted smart wallets that are more secure than any other kind of wallet currently available. These wallets make it simpler than ever to make payments without dealing through an intermediary like Google Play or iTunes since they can handle tokens for specific projects or commercial settings.

BLOCKCHAIN CAN ASSIST IN THE MANAGEMENT OF DIGITAL PROPERTY
Blockchain's ability to grant intellectual property rights can serve as a catalyst for the digital ecosystem. E-books can't experience the same growth cycle as physical books due to cultural conventions. You aren't allowed to sell or even rent your electronic book under this method.

This makes it impossible for readers to recommend their favorite books to friends and family members and for authors to get royalties from secondhand sales and borrowing fees.

Store-bought books are a rare breed that are vanishing. By monetizing their use, e-books and blockchain provide a potential to increase their value beyond anything originally expected, yet this is just the tip of the oceanic plate.

Imagine if we had the option to freely sell or share our e-books with others, including libraries, schools, and institutions all around the world.

The most secure method for electronically storing data is now blockchain. A tale cannot be modified after it has been written without leaving a mark. Blockchain keeps track of every transaction for each copy of the book, making it impossible for someone to copy or steal your story and profit from your personal project.

READY FOR BLOCKCHAIN PUBLISHING!
Publishing could change its criteria, thanks to the groundbreaking technology known as blockchain. Authors would be able to profit from their work and own their intellectual property without having to worry about plagiarism or copyright violations.

Blockchain will play a significant role in the future in digital publishing.

CONCLUSION
It is evident that some publishers are apprehensive about engaging with technologies like AI and Blockchain. But having a basic understanding of these concepts will make them more adaptive and open to embracing these technologies, which will ultimately increase their chances of growth and diversification in the years to come.

In blockchain, cryptocurrency, publishing Tags blockchain, cryptocurrency, publishing, publishers, book marketing

Machine Learning and Bestseller Prediction: More Than Words Can Say

September 7, 2017 Chris Sim
riccardo-annandale-140624-unsplash.jpg

There’s been much recent conjecture on whether book sales can be predicted by text analysis alone. My company, Kadaxis, has dedicated the past few years to machine learning research and product development for the publishing industry. In our early days, we set out to build an algorithm to predict bestsellers, and tested it in the wild. In this post, I’ll share my perspectives on why the text alone isn’t enough.

If You Publish It, Will They Come?

To predict book sales, you need to account for the factors that influence book sales. The text of a book is core to the product, but many other factors, such as sales and marketing, influence whether a customer will discover and buy it. An algorithm predicting book sales using only the text as input will only work in a book market meritocracy, where the best-written books always sell the most copies.

Author platform (brand awareness) is one such non-text factor that influences sales, as in the following examples:

– The Cuckoo’s Calling hits the top of Amazon’s bestseller list only after Robert Galbraith is revealed to be J.K. Rowling.
– Amy Schumer’s memoir, The Girl with the Lower Back Tattoo, hits the New York Times bestseller list in its first week—an improbable feat without her strong personal brand.
– Dave Eggers publishes multiple books and receives several award nominations prior to releasing bestseller The Circle, and does so after having appeared on television numerous times.

Even amongst well-discovered books, the relationship between reader satisfaction and sales volume can be tenuous. Consider Harry Potter and the Cursed Child and Go Set a Watchman, books that have sold millions of copies each, but have achieved star ratings of 3.6 and 3.4 on Amazon, respectively—scores below the average indicator of satisfied readers.

Many other factors might also influence book sales, such as the editorial process, cover design, marketing budget, seasonal trends and book metadata. A machine, just like a human, needs to consider which of these factors will make a book sell more, to make an accurate prediction.

Machine Reading

Assume for a moment a linear relationship exists between reader satisfaction, discoverability and sales (i.e. the best written books are found the most often and sell the most copies). In this author’s utopia, we can reliably predict sales volume directly from a book’s text, as long as we can measure what’s important to readers. As products go, books are nuanced and complex, and the reasons why they resonate with us are also complex (compared to, say, a toothbrush). How do we uniformly distill the unique traits of a book into data?

This is, of course, where machine learning helps us. One approach, which is also the method used by the authors of the much-talked-about The Bestseller Code, is topic analysis (or latent Dirichlet allocation). This technique allows us to define a book in terms of how much of a topic it contains, such as “Homicide – 8.7 percent.”

If you’d like to see the data a topic model creates, you can view an example from our systems here (or upload your own book for analysis at authorcheckpoint.com). Topic modeling gives us a good snapshot of the content of a book, and allows us to make apples-to-apples comparisons between them. It is also useful data to use as input to training a predictive algorithm.

The Curse of Dimensionality

Our machine reader might define thousands of topics for each book we’re analyzing. While more data points might seem like a good thing, the more we add, the more books we need to read in order to make reliable predictions. If, for example, we had 2,500 different data points about a book, we’d likely need several tens of thousands of books to be confident our algorithm is accurate. Even 20,000 books (the data set used in The Bestseller Code) is likely far too few books, and puts us at risk of the curse of dimensionality.

(A quick tech side-bar: even with cross-validation we’re still likely overfitting our data, and hold-out is no guarantee against this, especially when using heavily unbalanced classes such as “bestsellers” for classification.)

Too many data points, and not enough books, means our algorithm will probably find patterns to say whatever we want them to say. The patterns exist in the data, but they aren’t representative of the real cause of what we’re trying to predict. In the world of black-box trading systems, this phenomenon is well-known.

So is there value in analyzing the intrinsic qualities of books such an algorithm might identify as selling well? It might be an interesting exercise, and the similarities the algorithm finds might make sense to a human observer. But you couldn’t reliably conclude that those similarities were the reason the books both sold well. In a contrived example, we might conclude that books with a red cover, 250+ pages in length and featuring a dog instead of a cat, will sell more copies than those without.

There is, of course, a simple way to prove the efficacy of any predictive model, and that is to apply it to new, unseen books before publication.

Predicting What’s Important

Even with access to enough books in our author’s utopia, we of course need a reliable metric to measure. Bestseller lists are a weak proxy for actual sales volume for many reasons, not least for the fact that they reflect “fast sellers,” meaning a book on a list may sell less overall copies, over time, than a book that isn’t.

But rather than searching for a magic formula to help move more copies of a book, a more valuable and attainable goal is to solve for reader satisfaction. By tying together data about the content of a book, with data capturing a reader’s reaction to it (beyond tracking where they stopped reading), we can begin to understand the true impact a book has on a particular audience and why. Armed with this insight, we can better match books to readers (recommendation systems) and books to markets.

This article originally appeared on the DBW blog September 28, 2016

 

Tags machine learning, publishers, publishing, metadata, bestsellers

Who Uses the Keywords in Metadata?

March 4, 2017 Chris Sim
seth-weisfeld-510837-unsplash.jpg

We often hear that keywords are important to help readers find and discover books. But what does that mean, and do keywords actually make a difference? In this post, we look at how keywords are used to search book websites (in particular, online booksellers), and their adoption by publishers. For this investigation, I had help from Pat Payton (Bowker) and Catherine Toolan (Firebrand). We set out to answer the following questions:

• Are publishers adding keywords to book metadata?
• Are they providing quality keywords?
• Do online booksellers use keywords in their search engines?

In this post, “keywords” refer to consumer-oriented terms to describe a book that are added to an ONIX feed and sent to third parties. These terms aren’t seen by the public and are primarily used for search indexing. Conversely, web search engines (such as Google) don’t make use of ONIX keywords, but analyze the text of public webpages to create search indexes. As book content isn’t public, search providers rely on metadata to help consumers locate books.

Keywords Help Consumers Find Books

Most retailers solve the simple use cases of finding a book by title, author or category. Many searches, however, are comprised of natural language queries that describe different elements of a book, such as its setting, characters, theme or an emotional response to its content. Keywords were designed to fill this gap, by allowing people knowledgeable of the book to specify additional terms by which to find it.

Books are multi-dimensional, complex products that are typically highly nuanced and represent multiple buy trigger points for different types of consumers. Books have much more depth than, say, a kettle or a toothbrush, and determining the best keywords is therefore proportionally complex.

Note that extracting keywords from the book’s text is a naïve approach to solving this problem. The most effective keywords relate to a reader’s experience with a book, and the language she uses to describe it.

Are Publishers Adding Keywords to Their Books?

Bowker analyzed the keywords added to ONIX files from roughly 150,000 publishers, which included reprint and self-publishing service providers to university presses, trade, school and audio publishers. Of these publishers, about 23,000 (15.3 percent) had added keywords to at least one book. And of these, smaller publishers (less than 100 titles) typically had a higher percentage of keyword coverage than did larger publishers.

Over the past 10 years, though, publishers have increased the number of titles with keywords from approximately 25,000 to approximately 114,000, in 2015. But this number is still a very small proportion of all books available.

How Sophisticated Are Publishers’ Efforts to Choose and Maintain Keywords?

While keywords have been part of the ONIX standard for many years, they definitely rose in importance around 2013. As publishers had whole backlists without keywords, obtaining coverage was (and still is) a resource-intensive task. In order to achieve high coverage of keywords across a catalog, many publishers undertook a stopgap approach, adding other metadata to keywords (from title/subtitle, subject codes, contributors, product format, and audience), which are already available to search providers, and therefore are unlikely to help with search visibility. To improve keyword quality and to recommend against practices such as keyword stuffing, the Book Industry Standard Group (BISG) published the “Best Practices for Keywords in Metadata,” in 2014, to guide publishers on choosing effective keywords.

Keyword quality is still low today, though. One example from Bowker shows the use of the keyword “audiobook” (relating to form, not content) in just about 12,000 of the approximately 114,000 titles sampled from 2015.

Do Online Retailers Use Keywords?

Every book search implementation is proprietary, so the exact use of keywords is generally not public knowledge. It is possible, however, to determine whether keywords, when used as search queries, return the books they’re associated with in ONIX.

Kadaxis tested 13 websites that consume ONIX and provide book search, and found that only Amazon showed books returned in search results for keywords attributed to the book in ONIX.

Keywords are central to Amazon’s search capability across all its product lines. The site receives keywords of wildly varying quality from a huge number of product suppliers (from individuals to large companies), which means its capability for filtering, cleaning and incorporating keywords into a search index and mapping these to consumer search queries is sophisticated.

As the quality of keywords provided by publishers is generally low, it is a challenging endeavor for other websites, without this history and experience, to use the data as extensively.

Are Keywords Worth the Investment?

From the research above, Amazon is the only online bookseller making use of keywords today. If increasing sales of books on Amazon is important, then investing in keywords may be worthwhile. As most publishers aren’t adding keywords to their titles (and of those that are, the quality is typically low), there also appears to be a window of opportunity in which publishers can gain a ranking advantage in Amazon’s search by adding keywords to titles.

Conclusion

While some publishers (see here and here) are quietly providing effective, consumer-oriented keywords, most aren’t investing significant resources. But doing so might represent a low cost, low risk investment for a potentially strong, recurring return. At least until a better solution is created, that takes the onus of keyword curation away from publishers and authors.

Additional thanks to Chris Saynor from OnixSuite.

This article originally appeared on the DBW blog March 4, 2016

Tags keywords, publishing, publishers, metadata

What are off-page keywords?

September 30, 2015 Chris Sim
john-schnobrich-510629-unsplash.jpg

In the world of publishing metadata, when we talk about keywords, we’re talking about structured off-page keywords, often sent in an ONIX file, from a publisher to a retailer like Amazon. The retailer indexes the keywords and matches them against customer search queries, in order to display relevant books to them. Keywords are made up of phrases used to describe a book and their purpose is to give a search engine clues about how to show a book to consumers. We call them "off-page", because the retailer uses them directly, and doesn't show them to customers, like they do with other book metadata such as the title or description.

Web search engines, such as Google, determine what content such as a web page is about, and also how people might search for the content. Off-page keywords put this burden on publishers or authors, who have the complex task of trying to understand how readers might search, then how a search engine will use the provided keywords.

A typical book search engine, that reads ONIX, will index various metadata fields, like the title, author, categories and so forth, data who’s primary purpose is to inform consumers about the book - it’s public data. It needs to be appealing and be constructed in a way that is optimized for a search engine to work with.

Conversely, the primary purpose of off-page keywords is to directly inform a search engine how to match a book against search queries. The intended audience is a machine, and the data is hidden from consumers - it is "off" the product "page". This private nature gives publishers a lot of freedom to test and experiment.

Here's an official, dry, textbook definition of keywords in publishing:
“Keywords are words or phrases to describe the theme or content of a book. They are assigned by the metadata creator to supplement title, author, description or other consumer facing data.”

While accurate, it leaves out the motivation behind why we use keywords at all.

On the surface, keywords are just a metadata element. But used properly, they can be a powerful discovery mechanism to capture a reader’s experience with a book, in a way that facilitates sharing that experience with others.

Creating effective keywords is an exercise in studying reader psychology and linguistics, requiring empathy and insight into how people communicate about books with each other. If you’re able to think and talk like your audience, you’re more likely to reach them.

Keywords are used to sell all kinds of products online, but creating them is probably toughest for publishers, as books are far more complex and subjectively experienced than other products, like toothbrushes or hair dryers. So figuring out which elements to express can be challenging.

How do search engines use keywords?

Search engines are just computer programs written to find information for us. We type a query, and the engine thumbs through large swathes metadata to decide what books to display.  The richer the metadata, the more search queries the book might match to.

A book with only basic metadata (title and author and so forth) will show up in fewer search results than the same book with 100 or even 50 good keywords. Every keyword you add is an opportunity to widen the search funnel, letting you suggest to the search engine another way consumers can find your book.

Most books are sold online, and most people find books through search (per Amazon). If you can improve a book’s visibility in search, you improve it’s likelihood of selling more copies. A recent study by Recode, a tech news website, found that more shoppers begin their product search on Amazon (55%) than Google (28%).

Tags off-page keywords, metadata, amazon search, books, publishing

A Publisher's Advantage Over Indie Authors

May 23, 2015 Chris Sim

View image | gettyimages.com

When it comes to book discovery and retail search, traditional publishers have two advantages over indie publishers.

More Keywords

The first is the ability to add more keywords to a book. Most independent authors will be able to add 5-7 keywords to their book's metadata. Each keyword (or phrase) provides an opportunity for the book to be matched with more search queries. The more search queries a book matches, the more times it will show up in search results, which of course means an increase in the potential customers who will see the book.

How is this possible?

Each online retailer (such as Amazon), accepts book metadata through different channels, and processes it for the search engine to use. Most independent authors add their data through a website (such as http://kdp.amazon.com). These websites are coded with specific rules about what data can be added and restrict, for example, how many keywords can be entered. (This is necessary to ensure a minimum level of quality in the data, which can impact search results).

Enter ONIX

Publishers, on the other hand, typically send book metadata to retailers in bulk, using an industry file format called ONIX. Under the ONIX standard, the keyword field has no restriction on the number of keywords that can be added to a book.

View image | gettyimages.com

Practically speaking, adding a very large number of keywords will lead to limited discovery benefit, as each online retailer will parse and process a maximum number of keywords.

But retailers almost certainly accept more than 7 keywords. The ONIX standard recommends filling the keyword field with 250 characters. The BISG working group, dedicated to book keyword best practices, however, recommends using 500 characters. This working group comprises members from Amazon and Barnes & Noble, the consumers of this book metadata, who use it in their search engines (we covered this group in our last post). Given this recommendation, and the contributors behind it, it's unlikely the book retailers would restrict keywords to less than 500 characters.

So how does this compare to the 5-7 keywords self-published authors are allowed? 500 characters equates to approximately 80 words, which is at least 27 keywords (phrases of 3 words in length - or more if keywords of 2 words are included). This is almost 4 times as many keywords.

Isn't having too many keywords bad for SEO?

Retail search engines process keywords from metadata in a structured manner (as opposed to web search engines that extract keywords content like web pages), and are unlikely to be subject to keyword dilution, which is the idea that using more keywords reduces an individual keyword's value. Keyword dilution can be a problem for a web page, as the breadth of topics a web page covers is likely to be less than an entire book.

It makes sense to use many keywords to describe the many topics in a book.


Does all this equate to a discovery advantage?

It is highly likely. If you use one keyword, your book will be matched to related search queries about that one topic. If you use 20 or more keywords, your book has 20 opportunities to be matched against different types of search queries, therefore significantly increasing the number of customers likely to see the associated book.

Better Search Data

The second advantage for publishers, who publish on Amazon, is data to help with Amazon SEO. Publishers with a high enough sales volume, will be invited to apply for access to ARA (Amazon Retail Analytics) which provides insight into search queries used by customers on Amazon. Core to any effective SEO strategy is the ability to evaluate and assess different keywords (search terms) for search volume (this is what Google Analytics provides free). ARA provides this data, albeit in a slightly obfuscated value called 'Search Frequency', along with data about conversions. Access to this data improves the efficiency and accuracy of keyword selection, as it allows publishers to determine whether to apply a long or short tail keyword to a book (depending on it's sales rank), and also to assess the type of books that convert the best for each search term.

View image | gettyimages.com

We can speculate why these differences exist, which are likely rooted in the history of traditional publishing and of self-publishing. In the early days of self-publishing, independent authors were much less sophisticated than they are today, as were the service providers to help them publish. It's likely disparities such as these, that provide considerable advantage to traditional publishers, will become less pronounced as the self-publishing industry matures.

Tags publishing, metadata, keywords, amazon, barnes and noble, onix, seo

blog

Mostly AI and Books

Blog
The Emotional Frontier: How AI is Revolutionizing the Way We Express and Understand Sentiment Through the Written Word
about a year ago

Does your book have the right keywords? Buy here