Inspired by the popularity of knowledge graphs, VRT is interested in using the technology on top of their media archives to enable smart content search and exploration by archivists, journalists and program creators.
AT A GLANCE
Siloed media archives are not easy to explore for business users
Current search results don't bring up valuable results
Archive experts are needed to get value out of the various media libraries
We showed the possibility that journalists & program creators can explore media across archives themselves instead of going through an experienced archive expert
Content is more reusable as it's easier to find
In the future archive experts can improve the quality of the media libraries in a more targeted manner
Currently locked potential includes more context aware recommendations for content can be provided
CONTEXT & OBJECTIVES VRT's media library contains millions of items and a variety of media types such as video, news articles, voice recordings, and images. This content is spread over multiple database systems, each with its own unique way of storing metadata and substance. These libraries are maintained and expanded by experienced archivist teams but don't have a single business-friendly search and exploration function across systems. SOLUTION
Dotdash builds as a proof-of-concept a knowledge graph layer on top of various media libraries that connects different metadata types and content. This graph ingests and relates content by topics, entities, locations, contributors, and program structure. Natural language processing is used to extract important named entities(persons, organisations and locations) from content.Wikidata is used as external source to enrich and relate the named entities. Content and related topics are recommended by using the structure and context from the knowledge graph. BENEFITS Improved data exploration We showed the possibility that after a google-like search, a visual representation of connected content gives an easy way to explore new and interesting content from the same or related topics.
Reduced support from archive experts In the future business users can explore media more autonomously, freeing up time for archive experts to further improve the data governance of media archives. Context aware search & recommendations Currently locked potential includes that more relevant content can be found and recommended for a given search term and content item. A search or recommendation for "Calatrava" could now include the train station of Luik, when that information was originally not in the data. Furthermore, search results can filtered more efficiently based on context aware facets.
Download our whitepaper on creating business value with smart data