Voicemail / intercom via Mastodon #447

nguillaumin · 2022-12-18T11:22:25Z

nguillaumin
Dec 18, 2022

Hi,

I started looking into implementing the voicemail / intercom feature where you can record a message and have it played back by the spouse/paired rabbit. The general idea would be to have a new type of Mastodon message that would embed a media: the recorded message in mp3.

On the sender: Pressing and holding the button should record the message and trigger a new Mastodon post for the spouse.
On the receiver: Receiving such a message should cause it to be played back (immediately in an initial version as it's easier, later they should be stored and replayed on demand when clicking on the button. The nose of the receiving rabbit should blink red to notify there are messages available, I think that's how it used to work)

One challenge is that currently button holding and recording is hard-coded to trigger the automatic speech recognition (ASR). I'm not sure how best to change this.

Ideally I was thinking a new boolean field on the nabmastodond config to enable/disable the intercom. If enabled then the ASR should be disabled, but it doesn't feel right for nabd (where the ASR lives) to access the nabmastodond config?

Alternatively the recording action could be generic and trigger a new type of packet recording_event which can then be handled by the services. There would be the need for a global config flag that enables or disable ASR. If enabled, keep the current behavior. If disabled, dispatch the recording in an event which in this case will be handled by nabmastodond. But I feel that might not be easy to "explain" in the web interface (e.g. a toggle setting between "Enable ASR" or "Disable ASR and send audio to the services" would be confusing).

Also not too sure how to pass down the audio content down to the services. I saw that the audio used to be stored in a file sound_alsa_recording.raw but currently it only happens if the logging level is DEBUG. One option would be to always write to the file, and each service could read that file. However that would be problematic if 2 recordings are made in quick succession as the second one will overwrite the first one while it's being processed... So another option would be a new file for each event, with a timestamp in the name for example, and the event would contain the filename. There would be the need for a cleanup task though.

I would be glad to get input on this and how best to approach / design it? I saw there was an issue #24 about this, maybe @pguyot had an implementation design in mind?

Also I don't really know Python and the async code makes it quite complicated for me to follow, so I'm not sure how far I can get. Still, I will make an attempt!

pguyot · 2022-12-21T07:25:41Z

pguyot
Dec 21, 2022
Collaborator

Sending messages was essentially web based and didn't use the Nabaztag's microphone, this was a different feature. A first step towards what you describe can be a messaging feature over Mastodon with pre-recorded sounds. This also requires setting up a voicemail, i.e. a list of messages that can be seen from a web interface and can be replayed by pressing the button. This is what #24 is about.

Regarding what you describe (recording a sound from one rabbit and playing it on another one), the question is how the feature should be triggered.

If the feature can be trigged by say a RFID tag, an ASR voice command or from the website, then you can write an application that will set the nabaztag in interactive mode in which the ASR is effectively disabled and the button can be used as you please. In such a scenario, it would easy to extend the interactive mode to effectively allow a recordig_event packet.

The other thing to be aware of is that the microphone is of poor quality and the sound will not be so great, especially with the RFID and LEDs that generate noise.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Voicemail / intercom via Mastodon #447

{{title}}

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Voicemail / intercom via Mastodon #447

nguillaumin Dec 18, 2022

Replies: 1 comment

pguyot Dec 21, 2022 Collaborator

nguillaumin
Dec 18, 2022

pguyot
Dec 21, 2022
Collaborator