Package that supplies a number of Microsoft services and implements the TTSService and TextTranslationService protocols.
The package includes an implementation of the Microsoft Text Transalator API.
Get aquainted with the api here: https://docs.microsoft.com/en-us/azure/cognitive-services/translator
To get started you need to create a translator resource in azure and obtain an acecss key and region of the service.
After that you're ready to get started. Begin by setting up your translator
let translator = MSTextTranslator(config: .init(key:"your key", region:"your service region"))
There's a number of translation methods available depending on your needs. You can supply an array of strings, a dictionary of keys and values or a single string. You can also translate multiple langauges at one time.
let texts = ["my_string": "My untranslated text"]
do {
let table = try await translator.translateAsync(texts, from: "en", to: ["sv","fr"])
print(table.value(forKey: "my_string", in: "sv"))
} catch {
debugPrint(error)
}
let texts = ["my_string": "My untranslated text"]
translator.translate(texts, from: "en", to: ["sv","fr"]).sink { [weak self] compl in
if case let .failure(error) = compl {
debugPrint(error)
}
} receiveValue: { [weak self] table in
print(table.value(forKey: "my_string", in: "sv"))
}.store(in: &cancellables)
The TextTranslationTable is a structure used to store and return translated values. The structure can be used to reduce the number of calls made to the backend by ensuring that a value isn't translated more than once. Once one or more values has been translated you can store the returned table for later use. You don't have to check if the table contains a string before you translate, the framework will do that for you.
class Translator : ObservableObject {
let service = MSTextTranslator(config: .init(key:"your key", region:"your service region"))
var translations = TextTranslationTable()
init () {
service.logger.publisher.sink { e in
print(e.description)
}
}
func translate(_ texts:[String:String]) {
do {
translations = try await service.translateAsync(texts, from: "en", to: ["sv","fr"], storeIn: translations)
} catch {
debugPrint(error)
}
}
func string(for key:String, in language:String) -> String? {
return translations.value(forKey: key, in: language)
}
}
If you're creating an app and you're making use of native localization, you might want to take a look at the Dragoman package and how the MSTextTranslator can be used to automatically write and read translated texts to strings files.
More information can be found at https://github.com/helsingborg-stad/spm-dragoman
There are a number of limitations to the api depending on your subscription level. The framework them all into account with one exception and that's number of requests per minute. You can read more about the limitations here: https://docs.microsoft.com/en-us/azure/cognitive-services/translator/request-limits
MSTTS
is a concrete implementation of the TTService
protocol. In order for the library to function you need to include the AudioSwitchBoard
package: https://github.com/helsingborg-stad/spm-audio-switchboard
let switchboard = AudioSwitchBoard()
let tts = MSTTS(config: .init(key:"your key", region:"service region"), audioSwitchboard: switchboard)
// Add phonemes or an other replacment for specific words or phrases
tts.pronunciations = [
.init(string: "some word", replacement: "<phoneme alphabet='ipa' ph='PHONEME'>some word</phoneme>")
]
// Create an utterance. With the utterance you can also follow status of that specific utterance using it's status property
let u = TTSUtterance("Testing the microsoft text to speech library", locale: Locale(identifier:"en-US"))
// Wait for the service to be available, it need to grab all available voices before it continues.
tts.$available.sink { a in
guard a else { return}
// start speaking
tts.start(utterance: u)
}.store(in:&cancellables)
While the instance can be used as is, it's best used in combination with TTS
package. The TTS
package delivers a queue and an abstraction layer to the service. It can also be used to manage more than one service at the time and chooses the best available service to use.
More information about how to set this up can be found in the TTS service documentation: https://github.com/helsingborg-stad/spm-tts
This section is directed to the developer of this package
Microsoft has yet to release it's own SPM package. Therefor this package includes a binary target declaration of their static framework.
In case the framework is updated the developer of this repo must make sure to add the new zip-reference and change the checksum.
Once you've obtained the zip url(have a look at the current url and just change the version number, it should work) you need to add it to the url to the Package.swift
file.
Once that's done you need to download the zip file to your computer and run shasum -a 256 [FILENAME].zip
. The result of the command should go into the checksum
parameter of the Package.swift
file
- MSTTS and all related classes is due for a refactoring (with concurrency in mind)
- list available services
- add download support (ie offline support) to the MSTTS
- finish the speech recognizer implementation using the STTService protocol
- create a STTTranslationService protocol similar to the SSTService protocol and implement it to the MSSpeechTranslator
- code-documentation
- write tests
- complete package documentation