Posts

DW uses news.bridge prototype for automated subtitling in day-to-day news operations

,

nb_lang_sub

Right from the start, project news.bridge has been about creating a language technology platform fit for daily production workflows at modern media organizations. Our goal is to provide a collection of transcription, translation, subtitling and voice-over services that reporters and editors are able to use hassle-free. In the last couple of months, we’ve taken another big step towards this goal: DW newsrooms have started to work with the news.bridge prototype.

Kudos for pioneering spirit go to both the Hindi and Portuguese language teams, who were the first to use our (still unfinished) platform for the automated production of subtitles for program items on an actual publishing schedule. Needless to say, there were a couple of hiccups, but all in all the subtitling process went really well, and the results speak for themselves.

DW Hindi used news.bridge to turn English audio into Hindi subtitles in a short web video on activists fighting against female genital mutilation in Guinea:

https://twitter.com/dw_hindi/status/1068876286259531778

nb_dw_hindi

DW Portuguese for Africa worked with news.bridge to create Portuguese subtitles from German audio for an episode of Euromaxx’s “Baking Bread”. The video focuses on pão de milho, Portugal’s famous cornbread:

https://www.dw.com/pt-002/baking-bread-o-que-nos-ensina-a-broa-de-milho-sobre-portugal/av-48283900

nb_dw_brasil

By now, a team of editors from different newsroom has started adapting long-form videos from German and English. A series of documentaries on celebrated art school Bauhaus will soon be published with Russian and Brasilian Portuguese subtitles, for instance. The Turkish, Indonesian and Swahili desks have also experimented with news.bridge.

Just a couple of days ago, the DW Brasil department announced they’re preparing the production of subtitles for no less than 12 (German-language) instalments of DW’s Reporter series. We think news.bridge will be just right for the heavy lifting in the translation process.

Meet us in London and Bonn

Our colleagues at the BBC are hosting two very interesting hackathons focused on human language technology in (news) media production, and we’re happy to spread the word:

textAV will take place in London on September 18th and 19th. It caters to “technologists, application developers, and practitioners working in the area of online audio and video, with a particular focus on the use of captions and transcripts to facilitate and speed up the production process”. For more information and a schedule, check out the textAV eventbrite page. Please note that web registration has already ended. A few tickets may still be available via textav@bbcnewslabs.co.uk.

Summa #newsHACK (Pt. II), co-hosted by news.bridge partner Deutsche Welle, will take place in Bonn on October 9th and 10h. The central question of this event will be: “How could cutting-edge language processing technology transform your newsroom?” Developers, designers, and innovation managers will have access to the powerful Summa platform, which offers services for automated translation, entity extraction, topic detection, summarisation, and story clustering. Check out the Summa #newsHACK eventbrite page to learn more.

Several members of the news.bridge consortium will join the HLT design sprints in London and Bonn, so make sure to say hi if you’re also attending. Get in touch any time via info@newsbridge.eu or @newsbridge_hlt. We’re looking forward to seeing you at textAV and Summa #newsHACK!

5 questions with… Peggy van der Kreeft

Peggy van der Kreeft discussing DW’s human language technology projects at Global Media Forum 2017

 

The news.bridge consortium consists of four partners: Deutsche Welle (DW), the Latvian News Agency (LETA), the Laboratoire d’Informatique de l’Université du Mans (LIUM), and Priberam. We are all part of the media and tech industry in some way, we’re all fascinated by what cutting-edge human language technology (HLT) can do, and we’re all dedicated to this DNI project. Other than that, we’re actually pretty different organizations and people with a wide range of interests and a distinctive set of skills. So we figured it would be a good idea to sit down, ask the four team leaders a couple of questions — and give you some more insights on who is doing what exactly, and why it’s all worthwile.

For the first of part of this series of posts, we’ve talked to Peggy van der Kreeft, an experienced linguist and innovation manager, who is running news.bridge at DW.

Peggy, when and how did you first get in touch with human language technology (HLT)?

That was is in the early 1980s, believe it or not, during my postgraduate on “Translating at European Level” at the University of Louvain. I tried out early machine translation (MT) systems like SYSTRAN. Later on, in the 90s, I used Babel Fish while working at an American translation and documentation center. The first research project at Deutsche Welle that focused on MT was CoSyne, which ran from 2010 to 2013. Among other things, we experimented with translating DW’s Today in History. The quality wasn’t good enough for direct publication, but there was always post editing, and we already succeeded in making the translation process a little more efficient. My first experience with automated speech recognition (ASR) and speech-to-text (STT) was in the scope of project AXES (2011-2015). This one was about finding novel ways to explore and interact with audiovisual libraries. The platform itself wasn’t sophisticated enough for use in a real production environment, but it certainly showed the power of HLT — which has been a focus topic of our department (DW Research & Cooperation Projects) ever since.

What is the most fascinating aspect about news.bridge?

There are many fascinating aspects, so it’s really hard to choose one. Perhaps the most striking thing is that the platform is so powerful, even though it’s based on a fairly simple concept. news.bridge covers virtually any language, and through the use of external tools, it remains state of the art.

What is the project’s biggest challenge?

Well, the overall challenge is to make news.bridge stable, scalable and provide a smooth workflow for the entire process of creating subtitles (and voice-overs). A challenge we’re currently focusing on is the seamless ingestion of existing scripts — which are rarely standardized. However, using original scripts (instead of ASR/STT) always leads to the best results, so we need to work on this.

Who’s in your team and what are they currently working on?

Here at DW, we’re currently four people: Ruben Bouwmeester, Hina Imran, Alexander Plaum, and myself. Ruben, who joined the project very recently, works on the project’s business development and marketing plan. He also makes sure we always have professionally designed dissemination material. Hina is our developer. She works on customized user interfaces for HLT applications and output, coordinating technical issues with LETA and within DW. She also manages and maintains a local test installation of news.bridge. Alex is in charge of communication and dissemination. He runs our website, this blog, and our Twitter account. He also works on brand design and sometimes acquires new partners. As for me, I am the HLT lead at our department and also the main coordinator of news.bridge. That means I take care of operations, oversee and report progress, organize user testing and plan implementation.

Where do you see news.bridge in five years?

news.bridge has already attracted quite a bit of attention. Many broadcasters, news producers and language technology providers are interested in implementing it as soon as possible. Multilingual content has become very important, and news.bridge will significantly speed up production workflows — with modest investment costs. We hope we’ll be able to offer a reliable service — local installations and software as a service (SaaS) — sometime in 2019. But we are not waiting for that. Our first major test case is here at Deutsche Welle, with its many newsrooms and its international orientation. We’ve started betatesting the platform for automated translation and subtitling of videos in different languages, and we’re taking if from there. It would be great if news.bridge became a standard HLT platform by 2023; it certainly has the potential. In order for that to happen, we need to find the right exploitation partners and strategies, of course. news.bridge is not a startup, but a media innovation project. When it’s finished, the platform will lead its own life.