Wikispeech
Key Partners
STTS – Södermalms talteknologiservice ABThis company has R+D knowledge and skills relating to language technology that WMSE does not have in-house. We therefore partnered with them in 2015 and have since been developing Wikispeech jointly. KTH - Royal Institute of TechnologyResearchers at the Division of Speech, Music and Hearing (TMH) provide cutting engineering research expertise to the development of Wikispeech. They have been working in the project since 2015. The Swedish Dyslexia AssociationAs a disability association for people with reading and writing difficulties/dyslexia and their relatives, parents and other interested parties the team has a deep understanding of issues and needs and a strong network in the area. Wikimedia DeutschlandThe association has a technical advisory role in the project and has actively been helping Wikimedia Sverige's developer team with architecture descensions and navigating the deployment process. |
Key Activities
Software developmentDeveloping the Wikispeech extension and supporting tools like the speech collection application. Language resources developmentIn particular speech recordings. User research and testingTo ensure it provides value to users Wikispeech needs to be continually evaluated from a user perspective. With the results from the user testing looped into into the development process. |
Value Proposition
A Wikipedia for listenersThe mission of the Wikimedia movement is to make free knowledge available to everyone. A hurdle in reaching that mission is that not everyone can read, at all or well, or prefer to read in order to learn. Being able to listen to Wikipedia articles is one way to overcome that hurdle and Wikispeech provides precisely that service – to the WMF as the developer of the MediaWiki-software and to the Wikipedia communities. Close to 300 million people of all ages have a visual impairment to some degree, and almost 40 million are completely blind. Many millions of people have reading difficulties because of dyslexia or are illiterate. We estimate that in total upwards to 25% of all potential Wikipedia users would prefer being able to listen to articles over reading them.* Enriching other Wikimedia projectsThe language resources crowdsourced through Wikispeech will be possible to use to enrich other Wikimedia projects, such as Wikidata's lexemes or Wiktionary. Key techonlogy for future projectsThe technology developed for Wikispeech and the speech collection application can be reused for both language preservation efforts or as a key piece for collecting oral citations. |
Customer Relationships
Software provider to WMFAs they manage the Wikimedia platforms we have a software provider relationship to WMF. They control the software integration process. TTS solution provider to Wikipedia communitiesEven though the WMF delivers Wikispeech to the community via the Wikimedia platforms Wikimedia Sverige expects to (at least initially) be the organisation responsible for receiving any feedback and implementing it. Wikimedia Sverige will also to develop educational material/instructions for how new languages can be added and would expect (at least initially) to be involved in the addition of any such languages. Coordinator with open source developersThere are a number of open source projects with overlapping goals and somewhat similar approaches. One example is Mozilla Common Voice, with whom we have had an ongoing exchange with during the development of Wikispeech. Language data provider to reusersMany researchers, companies and organizations have a vested interest in developing speech solutions but are in need of language data, e.g. language resource and technology research centers, Google and Mozilla. Wikispeech can become an important source of data. |
Customer Segments
WMFAs the provider of the largest and most used MediaWiki-installations the WMF is our main and key customer. Wikipedia communitiesIn order to activate the Wikispeech extension on specific Wikipedia we need the buy-in of the Wikipedia communities concerned. Researchers and developers of language technologiesThe language resources gathered and collected as part of the development of Wikispeech should be of interest to academic researchers and other developers of language technologies. Other MediaWiki usersAny governmental agency in the EU is required to fulfil the European Accessibility Act. This also applies to any internally used tools – something we know MediaWiki is often used as. Other CMS-providersSpeechoid (the backend service used to synthezise the speech) can be used by other Content Management Systems. As the community improves Wikispeech – by correcting the pronunciation of articles – Speechoid will also be improved. |
|
Key Resources
Know-how of language tech and MediaWiki-development bothOur key resource is staff and partners with the competencies and knowledge required to build TTS-software for MediaWiki. Language resources (speech recordings)In order to be able to create new voices (including for new languages) the software must have access to large amounts of language resources, primarily speech recordings. Language resources (lexicon)To be able to improve the quality of Wikispeech in a direct, and very wiki inspired way, edits can be made to the underlying lexicon. Users can contribute with missing words or improve the phonetic transcriptions. The Wikimedia brandOperating under a Wikimedia brand gives us credibility with funders and partners. The Wikimedia volunteersThe hardworking community of volunteers improves the data allowing the Wikispeech solution to become better and better. |
Channels
Swedish Wikipedia (where the extension is available)On Swedish Wikipedia, where the extension is available, a logged in user will initially have the option of activating Wikispeech through the Beta Feature mechanism. New beta features are communicated through newsletters and users have frequently chosen to opted-in to any new features. MediaWiki siteThe MediaWiki site, https://www.mediawiki.org/wiki/Extension:Wikispeech, is where the extension is made available to developers and system administrators. Meta siteThe MediaWiki site, https://meta.wikimedia.org/wiki/Wikispeech, is where Wikispeech is presented to the Wikimedia community. CLARIN (and other research data platforms)The recorded speech data will be made available to Språkbanken Tal, https://sprakbanken.speech.kth.se/, the national speech data centre of Sweden. Språkbanken Tal in turn makes their resources available on the equivalent European platforms such as CLARIN and the European Language Grid. ML/AI resource sitesThere are a number of web sites where datasets suitable for use in ML/AI-applications can be published, eg. Kaggle. We should evaluate them and publish our language resources on at least one (and then evaluate them again based on what use/reach we see). |
||||
Cost Structure
Development team at WMSEWMSE needs dedicated software developers in order to be able to develop and integrate Wikispeech into MediaWiki. This includes also supporting tools for language resources collection, a key resource in powering Wikispeech. Technical infrastructure and integration support at WMFThe development of Wikispeech uses technical environments and services provided by the WMF. |
Revenue Streams
Specific grants for WikispeechTo further improve Wikispeech strategic applications for funding can be applied for from e.g. WMSE Donations (share of)A share of the direct donations to WMSE will be assigned to basic maintenance (bug fixes, minor technical upgrades) of the Wikispeech MediaWiki-extension and backed service. APG grants (share of)A share of the annual APG grant WMSE receives from WMF will be assigned to basic maintenance (bug fixes, technical upgrades) and minor developments (feature tweaks and additions, performance and stability improvements, smaller improvements to the underlying language models) and responding to user feedback. |