jarvis Trying to simulate jarvis kind of system. To-Do Get better understanding on the audio stream code. Integrating the whisper output to the llm.