How Metadata Enhances Content Discovery

Spherex • Feb 25, 2022

Media companies spend a lot of time and money studying and modeling consumer behaviors. It’s big business and a critical component of today’s media marketplace. Entire companies, platforms with specialized engineering teams, academic researchers, entrepreneurs, and the public attempt to find the Holy Grail of search algorithms that provide the best way to recommend titles, so you don’t change the channel.


Algorithms are components of computer programs that analyze data to identify market and user trends, track inventories, improve network traffic efficiencies, and provide content recommendations to new or long-time system users. Artificial intelligence and machine learning are increasingly incorporated to augment and add insights into data analysis that would take years to build into the datasets these programs use. But when it comes to recommending what program to watch, one type of information is so important that without it, algorithms would fail miserably: metadata.


We’ve written about metadata before, so there’s no need to revisit all of that here. In this post, we will focus on how metadata contributes to the effectiveness of search algorithms and why getting it right can lead to increased revenue.

 

Metadata Feeds Search Algorithms


Over 1.7 billion households are now searching for a movie or TV show to watch over the air, stream, buy or rent. That number is expected to increase to 1.8B by 2026. At some point, whoever controls the remote has to think about what to watch. If they’re searching on a VOD or OTT platform, they’re entering titles, names of actors, genre type, age-rating, words, or phrases that describe what they’d like to see. If the customer has been on the platform for a while, the service has kept track of what they’ve watched and searched their catalogs for titles with similar styles of content.

 

Think about it this way: if you’re searching for a title to watch with young children, your search is likely going to include a “G” (or comparable) age rating. That’s metadata. If you’re looking for a romantic movie shot in Ireland, both the genre and the location are metadata. If your favorite musician is Andy Gibb, and you want to find which movies or TV shows he appeared in, guess what? You’re going to find out using metadata.


That’s the high-level stuff. Studios, distributors and platforms utilize metadata that consumers don’t even think about when building their search engines and algorithms. Here are a few examples of those kinds of metadata:


Synopsis Standalone or episodic
Season number Number of seasons
Season episode Series title
Work type Category
Season synopsis Official language
Supported languages Release date
Director Key actors
Country of origin Rent or buy
Available markets Closed captions
Language of metadata Country of metadata

Depending upon the platform, the number of metadata fields varies. Some platforms may request more descriptive details on character traits, such as whether the lead is “kind” or “obnoxious.” Is the story originally written for the screen, or is it based on a novel or actual events? Does the film have a strong female lead, or is it a film about a group of friends? These data add context to the film record and enhances search, classification and matching capabilities.


Additional subscriber and profile details are drawn upon to further define possible interests. For example, is the subscriber male or female? What is their income level? Where do they live? Are there children or senior citizens in the home? Which movies do critics or people near them recommend? These are key factors in personalization.


There can be thousands of bits of information used to suggest something for you to watch. For example, Netflix has 222M subscribers, each having dozens of data points about their content preferences and watch history. The amount of data processed for each search means not only must the programs and network systems themselves be extremely robust, the algorithms doing the work are very complex.


Figure 1 is a search algorithm. This one won the Netflix Prize, which the company crowdsourced to see if their search model could be improved by more than 10%. The winning team was awarded $1 Million. You can find details about their formula here.

Why Content Creators and Platforms Take Metadata Seriously



The theatrical, linear, streaming, online, or retail video content market is enormous. According to IMDb Pro, over 235,000 TV and movie titles are available in the US alone, and over 5.7 million available worldwide. The question for content creators and distributors is how will you stand out in such a crowded market? How will you get to the top of the search results? Can you even get noticed?   


Notwithstanding whether the title is any good or not, metadata alone isn’t going to get it near the top of the search results page, but it can help. Providing as much of it as the platform or store requests and developing it is a good investment of time and resources. Search algorithms do not care whether your title has data for each of the fields it utilizes, but you can be sure that if nothing is in the key fields, your title may be harder for consumers to find. The worst films still have metadata that describe them, especially if they’ve won awards for being bad. You may not have watched them, but most movie junkies have heard of “Plan 9 from Outer Space” or “The Room.” When you search for them, you’re going to find them and see detailed information about the title, its plot, actors and director, and why people think it’s so bad that it’s worth watching.


Obtaining high-quality and effective metadata is not a task left to the uninitiated. Studios like Disney and platforms like Netflix and Amazon Prime Video have teams of employees or contractors whose job is to watch and annotate metadata for their original movie and TV titles. Companies who distribute titles via TVOD or retail stores know better metadata makes their titles more easily findable by consumers and thereby more marketable. That means more sales, more rentals, more views and more revenue.


Global listings and metadata is one of Spherex’s core businesses and provides studios or platforms easy access to a massive data store of over 1 million unique titles, including artwork variations and trailers in 45 languages spanning over 140 locales. Covering many languages, versions, and formats, the Spherex datastore contains nearly 25 million title records for Hollywood's top movies and tv shows and titles produced worldwide. Title records are cleansed, normalized, localized and ready to use.


It's easy to dismiss or not be too concerned about a title’s metadata quality because it’s not something people see. But whether they realize it or not, it is something they use every day. With nearly 600,000 new titles released worldwide each year, competition for the top placement on results pages is only going to get more intense. Understanding the importance of metadata, taking advantage of its proper use can help get your content the audience attention it needs to positively impact your bottom line.

Share this post

Share this post

Related Posts

By Spherex 16 Apr, 2024
"The only way you can predict the future is to build it." - Alan Kay
By Spherex 08 Apr, 2024
Technology continues to drive change and opportunity.
By Spherex 27 Mar, 2024
Santa Clara, CA (March 27, 2024 ) — Spherex , a global technology and data company serving the Media and Entertainment industry, will showcase Spherex AI ™, its pioneering AI platform for video content compliance and analysis, at NAB 2024 in Las Vegas from April 13 to 17. The Spherex AI ™ platform empowers content owners to adapt their narratives for the broadest international audience while avoiding regulatory restrictions and censorship barriers. Spherex will be exhibiting at Booth W2217 in the West Hall of the Las Vegas Convention Center. Spherex AI ™ drastically reduces the time and resources required for video content localization, mitigates the risk of cultural insensitivity or noncompliance, and enables creators to accelerate expansion and reach the right audience faster, cost-effectively, and without brand risk. The platform scans film, TV, or advertising content quickly and affordably, making it the ideal solution for verifying the suitability of any video content for any global market. “Today, content providers and streaming platforms must navigate complex regulations and cultural norms to operate successfully in different markets,” said Teresa Phillips, Co-founder and CEO of Spherex. “By seamlessly integrating sophisticated AI with cultural and regulatory insights, Spherex helps content creators, studios, distributors, and platforms unlock new markets, broaden their audiences, and increase revenue.” At the heart of Spherex AI ’s™ innovation is its proprietary AI technology, based on a decade of meticulous research and analysis of laws, regulations, and film/TV classifications in over 100 countries, as well as the screening of millions of film and TV clips for training and validation. Unlike conventional methods that employ AI and Machine Learning (ML) to identify objectionable content in a scene, Spherex uses advanced techniques to interpret the cultural context surrounding classifiable elements such as drugs, violence, sexuality, and profanity. Understanding the context is crucial because it's not just about the events happening in a video or who is involved; it's about comprehending the setting, characters, impact, and consequences. These factors are essential in determining local age ratings and cultural suitability for worldwide markets. To book a meeting with Spherex at NAB, contact Davinder Lu thra here . About Spherex Spherex is a global technology and data company transforming how media and entertainment enterprises create, adapt, and deliver film and television to audiences worldwide through AI and machine learning. With unmatched expertise in culture and regulatory compliance, Spherex works with the world's largest media companies, movie studios, networks, distributors, and streamers to build larger audiences, speed up content discovery, drive more video views, and generate higher revenue. Learn more at www.spherex.com .
05 Mar, 2024
Three Criteria to Consider When Adding Generative AI to Your Work
By Spherex 15 Feb, 2024
With the global nature of digital platforms requiring content localization, Artificial Intelligence (AI) emerges as a key tool to address these challenges by efficiently scanning, analyzing, and moderating content, thereby ensuring a safer and more culturally respectful digital environment.
By Spherex 30 Jan, 2024
Examine the tension between creative freedom and compliance in Malaysia's film industry, through the legal challenges faced by the creator "Mentega Terbang" for allegedly violating laws protecting religious sentiments.
By Spherex 12 Dec, 2023
Understand significant differences between film and TV ratings, emphasizing the complexities content creators face in preparing titles for television due to regulatory and market concerns such as commercial breaks, run-time, and cultural variations.
compliance metrics being evaluated on screen
By Spherex 20 Nov, 2023
Explore the evolution of ad placement in the dynamic landscape of streaming, D2C platforms, and linear TV. From the simplicity of targeting demographics to today's complex considerations involving regulated content, discover how advertisers navigate challenges.
By Spherex 11 Oct, 2023
There is no immunity from Censorship.
Show More
Share by: