BBC Sounds launches trial of generative AI-powered subtitles

29 August 2024, 17:44

BBC Stock
BBC Stock. Picture: PA

A range of audio programmes will have transcripts produced using an artificial intelligence tool as part of a three-month trial of the technology.

The BBC Sounds app has launched a trial using generative AI to generate subtitles and transcripts for a range of audio programmes on the platform.

The broadcaster said it was using a speech-to-text AI tool called Whisper, created and made available open source by ChatGPT creator OpenAI, to power the new feature, which will be publicly trialled on the web and Android versions of the BBC Sounds app, with support on Apple’s iOS to follow in the coming weeks.

It said the tool was used to “quickly generate a high-quality transcript” of programme audio before being reviewed and edited where necessary by the BBC’s editorial team before a final transcript was uploaded with the audio to the BBC Sounds app.

Aniruddh Dimri, head of product at BBC Sounds, said: “A crucial part of the BBC’s mission is that everyone across the UK feels the BBC is for them. Sometimes that’s about the content reflecting our different backgrounds, interests, and identity but it’s also about ensuring everyone can access our content.

“As an example, BBC Sounds currently produces approximately 27,000 hours of content per month – but much of it can be difficult to access for the approximately 18 million people in the UK who are deaf, have hearing loss or tinnitus in the UK. We have been exploring ways to add subtitles so people can follow the audio with the help of text.

“Doing this manually would be time-consuming and prohibitively expensive.

“However, as we pilot new technology and explore how we can work with and use Generative AI tools to benefit our audiences, we have been looking at whether AI can help us add high quality, accurate subtitles to our audio content.”

For the trial, the AI-powered transcripts will initially be used on programmes including In Touch, Access All, Profile, Sporting Witness and Economics with Subtitles, with the BBC looking to use the technology on further programmes throughout the three-month trial, it said.

“After three months we’ll review the progress made in the trial, how well the tools are working and if they’re a cost-effective way of making transcripts available in Sounds,” Mr Dimri said.

“After that review, we’ll determine whether or not to continue, and if successful whether to roll them out to more of our content on BBC Sounds, and potentially to expand to our archive as well.”

Generative AI tools have become increasingly common over the last 18 months, since the release of ChatGPT, as tech firms and other businesses have looked to take advantage of the frenzy around the technology and provide new tools to draw more consumers to their products and services.

By Press Association

More Technology News

See more More Technology News

The Darktrace wesbite

Darktrace set to leave London Stock Exchange at end of September

An unidentified hacker in dark hoodie performing at a comupter

UK convenes nations for talks on global cybersecurity

Icons of social media apps, including Facebook, Instagram, YouTube and WhatsApp, are displayed on a mobile phone screen

Meta to begin training AI on public posts from UK Facebook and Instagram users

JLR Rover the Boston Dynamics robot dog (JLR/PA)

JLR’s new ‘Rover’ is a robotic dog employed to protect brand’s EV facility

The logo and name of the technology company OpenAI on a smarthpone

OpenAI unveils new models designed to think more before answering

A person looking at a mobile phone whose screen has been blurred

Government strengthens Online Safety Act to crack down on revenge porn

Vodafone and Three logos

Vodafone and Three merger could increase phone bills for millions, watchdog says

A mobile phone mast being photographed by a mobile phone

6G network at least a decade away, expert says

A sign for the London underground in central London.

Teenager arrested over Transport for London cyber attack

Cyber security

BT ‘logs 2,000 signals of potential cyber attacks every second’

ChatGPT website with pink lettering displayed on a screen

OpenAI in talks to raise funds at £115bn valuation – reports

Person typing on a laptop

UK data centres to be designated as ‘critical infrastructure’

A plaque outside the offices of the Data Protection Commission in Dublin

Irish watchdog launches probe into Google’s AI model

The technology giant said the growth of cloud computing and artificial intelligence was key to the increasing investment (Niall Carson/PA)

Amazon Web Services ‘to invest £8bn in UK over next five years’

The hands of a person on a laptop keyboard

Most people have no plan for digital assets upon death, Which? warns

Economic statement

Drawing down Apple tax billions will take months – Ireland’s finance minister