{"id":2250,"date":"2026-01-10T10:04:47","date_gmt":"2026-01-10T10:04:47","guid":{"rendered":"https:\/\/society.europeanschoolradio.eu\/?p=2250"},"modified":"2026-01-10T15:41:40","modified_gmt":"2026-01-10T15:41:40","slug":"mdpiarticle012026","status":"publish","type":"post","link":"https:\/\/society.europeanschoolradio.eu\/en\/2026\/01\/10\/mdpiarticle012026\/","title":{"rendered":"Transforming Podcasts into Structured Knowledge with Artificial Intelligence"},"content":{"rendered":"<p><\/p>\n<p data-start=\"112\" data-end=\"376\"><a href=\"https:\/\/society.europeanschoolradio.eu\/wp-content\/uploads\/2026\/01\/multimedia-02-00001-ag.webp\"><img fetchpriority=\"high\" decoding=\"async\" src=\"https:\/\/society.europeanschoolradio.eu\/wp-content\/uploads\/2026\/01\/multimedia-02-00001-ag-1024x651.webp\" alt=\"\" width=\"750\" height=\"477\" class=\"aligncenter size-large wp-image-2251\" srcset=\"https:\/\/society.europeanschoolradio.eu\/wp-content\/uploads\/2026\/01\/multimedia-02-00001-ag-1024x651.webp 1024w, https:\/\/society.europeanschoolradio.eu\/wp-content\/uploads\/2026\/01\/multimedia-02-00001-ag-300x191.webp 300w, https:\/\/society.europeanschoolradio.eu\/wp-content\/uploads\/2026\/01\/multimedia-02-00001-ag-768x488.webp 768w, https:\/\/society.europeanschoolradio.eu\/wp-content\/uploads\/2026\/01\/multimedia-02-00001-ag.webp 1109w\" sizes=\"(max-width: 750px) 100vw, 750px\" \/><\/a>Podcasts are today one of the fastest-growing forms of digital content. Despite the richness of information they offer, their large-scale exploitation remains challenging, as they consist of unstructured audio that cannot be easily searched, analyzed, or filtered.<\/p>\n<p data-start=\"378\" data-end=\"632\">As part of our recent research and technological work, we developed a <strong data-start=\"448\" data-end=\"474\">fully automated system<\/strong> that transforms podcasts into <strong data-start=\"505\" data-end=\"565\">structured, analyzable, and recommendation-ready content<\/strong>, leveraging state-of-the-art Artificial Intelligence technologies.<\/p>\n<h5 data-start=\"639\" data-end=\"675\">From audio to written information<\/h5>\n<p data-start=\"677\" data-end=\"739\">Our system is built on an end-to-end processing pipeline that:<\/p>\n<ul data-start=\"741\" data-end=\"965\">\n<li data-start=\"741\" data-end=\"807\">\n<p data-start=\"743\" data-end=\"807\">converts audio into text through automatic speech recognition,<\/p>\n<\/li>\n<li data-start=\"808\" data-end=\"842\">\n<p data-start=\"810\" data-end=\"842\">processes and cleans the data,<\/p>\n<\/li>\n<li data-start=\"843\" data-end=\"925\">\n<p data-start=\"845\" data-end=\"925\">analyzes the text using NLP techniques to extract topics and key concepts, and<\/p>\n<\/li>\n<li data-start=\"926\" data-end=\"965\">\n<p data-start=\"928\" data-end=\"965\">recommends relevant content to users.<\/p>\n<\/li>\n<\/ul>\n<p data-start=\"967\" data-end=\"1108\">The result is a collection of podcasts that can now be searched and organized based on their actual meaning, rather than just titles or tags.<\/p>\n<h5 data-start=\"1115\" data-end=\"1164\">Artificial Intelligence with real-world impact<\/h5>\n<p data-start=\"1166\" data-end=\"1473\">This project is a clear example of how the combination of <strong data-start=\"1224\" data-end=\"1271\">Data Engineering, Machine Learning, and NLP<\/strong> can deliver meaningful solutions to real-world problems. Rather than relying on isolated models, we designed an architecture that operates at scale and is capable of supporting production environments.<\/p>\n<p data-start=\"1475\" data-end=\"1688\">For organizations that manage large volumes of audio or multimedia content, solutions of this kind enable improved content discovery, enhanced user experience, and new opportunities for data-driven value creation.<\/p>\n<h5 data-start=\"1695\" data-end=\"1735\">A collaboration with tangible results<\/h5>\n<p data-start=\"1737\" data-end=\"2194\">The development of the system was carried out through close collaboration between a research team from the <strong data-start=\"1844\" data-end=\"1881\">International Hellenic University<\/strong> and <strong data-start=\"1886\" data-end=\"1911\">European School Radio<\/strong>, within the framework of the European <strong data-start=\"1950\" data-end=\"1971\">Kids Radio Europe<\/strong> project, combining expertise in data analytics, artificial intelligence, and distributed systems. The outcome is not merely a research study, but a functional technological solution with clear practical and business value. The technology developed by our team is already in production on <a href=\"https:\/\/europeanschoolradio.eu\/\">europeanschoolradio.eu<\/a> and <a href=\"https:\/\/youthradio.eu\/\">youthradio.eu<\/a>.<\/p>\n<p data-start=\"2201\" data-end=\"2393\" data-is-last-node=\"\" data-is-only-node=\"\">For those interested in the technical details of the approach, including the methodology and system architecture, the full publication is available here:<br data-start=\"2354\" data-end=\"2357\" \/><a data-start=\"2357\" data-end=\"2393\" data-is-last-node=\"\" rel=\"noopener\" target=\"_new\" class=\"decorated-link\" href=\"https:\/\/www.mdpi.com\/3042-6308\/2\/1\/1\">https:\/\/www.mdpi.com\/3042-6308\/2\/1\/1<\/a><\/p>\n<p><\/p>","protected":false},"excerpt":{"rendered":"<p>Podcasts are today one of the fastest-growing forms of digital content. Despite the richness of information they offer, their large-scale exploitation remains challenging, as they consist of unstructured audio that cannot be easily searched, analyzed, or filtered. As part of our recent research and technological work, we developed a fully [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":2251,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[50],"tags":[],"class_list":["post-2250","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-50"],"acf":[],"_links":{"self":[{"href":"https:\/\/society.europeanschoolradio.eu\/en\/wp-json\/wp\/v2\/posts\/2250","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/society.europeanschoolradio.eu\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/society.europeanschoolradio.eu\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/society.europeanschoolradio.eu\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/society.europeanschoolradio.eu\/en\/wp-json\/wp\/v2\/comments?post=2250"}],"version-history":[{"count":2,"href":"https:\/\/society.europeanschoolradio.eu\/en\/wp-json\/wp\/v2\/posts\/2250\/revisions"}],"predecessor-version":[{"id":2253,"href":"https:\/\/society.europeanschoolradio.eu\/en\/wp-json\/wp\/v2\/posts\/2250\/revisions\/2253"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/society.europeanschoolradio.eu\/en\/wp-json\/wp\/v2\/media\/2251"}],"wp:attachment":[{"href":"https:\/\/society.europeanschoolradio.eu\/en\/wp-json\/wp\/v2\/media?parent=2250"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/society.europeanschoolradio.eu\/en\/wp-json\/wp\/v2\/categories?post=2250"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/society.europeanschoolradio.eu\/en\/wp-json\/wp\/v2\/tags?post=2250"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}