{"id":1266,"date":"2023-10-21T12:31:00","date_gmt":"2023-10-21T10:31:00","guid":{"rendered":"https:\/\/icare.ch\/automatic-generation-of-audio-description-and-l2v\/"},"modified":"2026-02-06T16:37:00","modified_gmt":"2026-02-06T15:37:00","slug":"automatic-generation-of-audio-description-and-l2v","status":"publish","type":"post","link":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/","title":{"rendered":"Automatic Generation of Audio Description and Lower Third Voice"},"content":{"rendered":"<p>[et_pb_section][et_pb_row][et_pb_column type=&#8221;4_4&#8243;][et_pb_text]Sub-project 4 (SP4) of the IICT Flagship focuses on the automatic generation of audio descriptions and lower third voices.<\/p>\n<p>Audio description is a technique used to make television programs more accessible to people with visual impairments. It involves a voice-over that describes non-spoken scenes in detail to ensure a better understanding of what is happening on screen. The lower third voice (or &#8216;synth\u00e9&#8217;) is an audio commentary, commonly used to provide complementary information to narrative elements (e.g., reading on-screen text). These operations prove to be very time-consuming for human users.<\/p>\n<p>The Icare research institute is involved in implementing various approaches based on specialized algorithms, artificial intelligence, and machine learning, to automate these tasks. This means automatically extracting visual information, generating a textual description, and storing the result in a structured computer file. This file can then be synthetically vocalized so that the viewer can access the information.<\/p>\n<p>Innosuisse Flagship PFFS-21-47: <a href=\"https:\/\/icare.ch\/en\/inclusive-information-and-communication-technologies-iict\/\">Inclusive Information and Communication Technologies<\/a>[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section]<\/p>\n","protected":false},"excerpt":{"rendered":"<p><div class=\"et_pb_section et_pb_section_0 et_section_regular\" >\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t\n\t\t\t<\/div><div class=\"et_pb_row et_pb_row_0 et_pb_row_empty\">\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t\n\t\t\t<\/div><div class=\"et_pb_module et_pb_text et_pb_text_0  et_pb_text_align_left et_pb_bg_layout_light\">\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t\n\t\t\t<\/div>Sub-project 4 (SP4) of the IICT Flagship focuses on the automatic generation of audio descriptions and lower third voices. Audio description is a technique used to make television programs more accessible to people with visual impairments. It involves a voice-over that describes non-spoken scenes in detail to ensure a better understanding of what is happening [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":1097,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_et_pb_use_builder":"on","_et_pb_old_content":"","_et_gb_content_width":"","footnotes":"","_links_to":"","_links_to_target":""},"categories":[22],"tags":[],"class_list":["post-1266","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-projects"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.9 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Automatic Generation of Audio Description and Lower Third Voice - Icare<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Automatic Generation of Audio Description and Lower Third Voice - Icare\" \/>\n<meta property=\"og:description\" content=\"Sub-project 4 (SP4) of the IICT Flagship focuses on the automatic generation of audio descriptions and lower third voices. Audio description is a technique used to make television programs more accessible to people with visual impairments. It involves a voice-over that describes non-spoken scenes in detail to ensure a better understanding of what is happening [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/\" \/>\n<meta property=\"og:site_name\" content=\"Icare\" \/>\n<meta property=\"article:published_time\" content=\"2023-10-21T10:31:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-02-06T15:37:00+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.icare.ch\/wp-content\/uploads\/2023\/02\/sp4.png\" \/>\n\t<meta property=\"og:image:width\" content=\"600\" \/>\n\t<meta property=\"og:image:height\" content=\"600\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"icare\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@instituticare\" \/>\n<meta name=\"twitter:site\" content=\"@instituticare\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"icare\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/\"},\"author\":{\"name\":\"icare\",\"@id\":\"https:\/\/www.icare.ch\/en\/#\/schema\/person\/94c8d500f59ac3c9a0bbefaa764c2a35\"},\"headline\":\"Automatic Generation of Audio Description and Lower Third Voice\",\"datePublished\":\"2023-10-21T10:31:00+00:00\",\"dateModified\":\"2026-02-06T15:37:00+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/\"},\"wordCount\":190,\"publisher\":{\"@id\":\"https:\/\/www.icare.ch\/en\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.icare.ch\/wp-content\/uploads\/2023\/02\/sp4.png\",\"articleSection\":[\"Projects\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/\",\"url\":\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/\",\"name\":\"Automatic Generation of Audio Description and Lower Third Voice - Icare\",\"isPartOf\":{\"@id\":\"https:\/\/www.icare.ch\/en\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.icare.ch\/wp-content\/uploads\/2023\/02\/sp4.png\",\"datePublished\":\"2023-10-21T10:31:00+00:00\",\"dateModified\":\"2026-02-06T15:37:00+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#primaryimage\",\"url\":\"https:\/\/www.icare.ch\/wp-content\/uploads\/2023\/02\/sp4.png\",\"contentUrl\":\"https:\/\/www.icare.ch\/wp-content\/uploads\/2023\/02\/sp4.png\",\"width\":600,\"height\":600},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Accueil\",\"item\":\"https:\/\/www.icare.ch\/en\/home\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Automatic Generation of Audio Description and Lower Third Voice\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.icare.ch\/en\/#website\",\"url\":\"https:\/\/www.icare.ch\/en\/\",\"name\":\"Icare\",\"description\":\"Institut de recherche Icare\",\"publisher\":{\"@id\":\"https:\/\/www.icare.ch\/en\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.icare.ch\/en\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.icare.ch\/en\/#organization\",\"name\":\"Institut de Recherche Icare\",\"url\":\"https:\/\/www.icare.ch\/en\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.icare.ch\/en\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.icare.ch\/wp-content\/uploads\/2024\/02\/icare_logo_favico.png\",\"contentUrl\":\"https:\/\/www.icare.ch\/wp-content\/uploads\/2024\/02\/icare_logo_favico.png\",\"width\":600,\"height\":600,\"caption\":\"Institut de Recherche Icare\"},\"image\":{\"@id\":\"https:\/\/www.icare.ch\/en\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/x.com\/instituticare\",\"https:\/\/ch.linkedin.com\/company\/institut-icare\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.icare.ch\/en\/#\/schema\/person\/94c8d500f59ac3c9a0bbefaa764c2a35\",\"name\":\"icare\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.icare.ch\/en\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/76911ddad9b73cb31b2c36844dce66574be8d94c40922ba6f3a9f67af5388b2a?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/76911ddad9b73cb31b2c36844dce66574be8d94c40922ba6f3a9f67af5388b2a?s=96&d=mm&r=g\",\"caption\":\"icare\"},\"sameAs\":[\"https:\/\/icare.ch\/\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Automatic Generation of Audio Description and Lower Third Voice - Icare","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/","og_locale":"en_US","og_type":"article","og_title":"Automatic Generation of Audio Description and Lower Third Voice - Icare","og_description":"Sub-project 4 (SP4) of the IICT Flagship focuses on the automatic generation of audio descriptions and lower third voices. Audio description is a technique used to make television programs more accessible to people with visual impairments. It involves a voice-over that describes non-spoken scenes in detail to ensure a better understanding of what is happening [&hellip;]","og_url":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/","og_site_name":"Icare","article_published_time":"2023-10-21T10:31:00+00:00","article_modified_time":"2026-02-06T15:37:00+00:00","og_image":[{"width":600,"height":600,"url":"https:\/\/www.icare.ch\/wp-content\/uploads\/2023\/02\/sp4.png","type":"image\/png"}],"author":"icare","twitter_card":"summary_large_image","twitter_creator":"@instituticare","twitter_site":"@instituticare","twitter_misc":{"Written by":"icare","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#article","isPartOf":{"@id":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/"},"author":{"name":"icare","@id":"https:\/\/www.icare.ch\/en\/#\/schema\/person\/94c8d500f59ac3c9a0bbefaa764c2a35"},"headline":"Automatic Generation of Audio Description and Lower Third Voice","datePublished":"2023-10-21T10:31:00+00:00","dateModified":"2026-02-06T15:37:00+00:00","mainEntityOfPage":{"@id":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/"},"wordCount":190,"publisher":{"@id":"https:\/\/www.icare.ch\/en\/#organization"},"image":{"@id":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#primaryimage"},"thumbnailUrl":"https:\/\/www.icare.ch\/wp-content\/uploads\/2023\/02\/sp4.png","articleSection":["Projects"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/","url":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/","name":"Automatic Generation of Audio Description and Lower Third Voice - Icare","isPartOf":{"@id":"https:\/\/www.icare.ch\/en\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#primaryimage"},"image":{"@id":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#primaryimage"},"thumbnailUrl":"https:\/\/www.icare.ch\/wp-content\/uploads\/2023\/02\/sp4.png","datePublished":"2023-10-21T10:31:00+00:00","dateModified":"2026-02-06T15:37:00+00:00","breadcrumb":{"@id":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#primaryimage","url":"https:\/\/www.icare.ch\/wp-content\/uploads\/2023\/02\/sp4.png","contentUrl":"https:\/\/www.icare.ch\/wp-content\/uploads\/2023\/02\/sp4.png","width":600,"height":600},{"@type":"BreadcrumbList","@id":"https:\/\/www.icare.ch\/en\/automatic-generation-of-audio-description-and-l2v\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Accueil","item":"https:\/\/www.icare.ch\/en\/home\/"},{"@type":"ListItem","position":2,"name":"Automatic Generation of Audio Description and Lower Third Voice"}]},{"@type":"WebSite","@id":"https:\/\/www.icare.ch\/en\/#website","url":"https:\/\/www.icare.ch\/en\/","name":"Icare","description":"Institut de recherche Icare","publisher":{"@id":"https:\/\/www.icare.ch\/en\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.icare.ch\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.icare.ch\/en\/#organization","name":"Institut de Recherche Icare","url":"https:\/\/www.icare.ch\/en\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.icare.ch\/en\/#\/schema\/logo\/image\/","url":"https:\/\/www.icare.ch\/wp-content\/uploads\/2024\/02\/icare_logo_favico.png","contentUrl":"https:\/\/www.icare.ch\/wp-content\/uploads\/2024\/02\/icare_logo_favico.png","width":600,"height":600,"caption":"Institut de Recherche Icare"},"image":{"@id":"https:\/\/www.icare.ch\/en\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/instituticare","https:\/\/ch.linkedin.com\/company\/institut-icare"]},{"@type":"Person","@id":"https:\/\/www.icare.ch\/en\/#\/schema\/person\/94c8d500f59ac3c9a0bbefaa764c2a35","name":"icare","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.icare.ch\/en\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/76911ddad9b73cb31b2c36844dce66574be8d94c40922ba6f3a9f67af5388b2a?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/76911ddad9b73cb31b2c36844dce66574be8d94c40922ba6f3a9f67af5388b2a?s=96&d=mm&r=g","caption":"icare"},"sameAs":["https:\/\/icare.ch\/"]}]}},"_links":{"self":[{"href":"https:\/\/www.icare.ch\/en\/wp-json\/wp\/v2\/posts\/1266","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.icare.ch\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.icare.ch\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.icare.ch\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.icare.ch\/en\/wp-json\/wp\/v2\/comments?post=1266"}],"version-history":[{"count":6,"href":"https:\/\/www.icare.ch\/en\/wp-json\/wp\/v2\/posts\/1266\/revisions"}],"predecessor-version":[{"id":29784,"href":"https:\/\/www.icare.ch\/en\/wp-json\/wp\/v2\/posts\/1266\/revisions\/29784"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.icare.ch\/en\/wp-json\/wp\/v2\/media\/1097"}],"wp:attachment":[{"href":"https:\/\/www.icare.ch\/en\/wp-json\/wp\/v2\/media?parent=1266"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.icare.ch\/en\/wp-json\/wp\/v2\/categories?post=1266"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.icare.ch\/en\/wp-json\/wp\/v2\/tags?post=1266"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}