StreamingResponse #9

franzwilding · 2024-04-11T06:52:44Z

In order to have a good LLM chat UX, we need to streame the response to the client. Langserve is doing this with an dedicated endpoint, hayhooks could do the same (pseudocode):

async def pipeline_stream(pipeline_run_req: PipelineRunRequest) -> StreamingResponse:
        buffer = ...
        result = pipe.run(data=pipeline_run_req.dict())

        return StreamingResponse(buffer_generator)

app.add_api_route(
        path=f"/{pipeline_def.name}/stream",
        endpoint=pipeline_stream,
        methods=["POST"],
        name=pipeline_def.name,
        response_model=PipelineRunResponse,
    )

Additionally haystack should provide a special streaming_callback that will write the chunk content to a buffer, that will be available to hayhooks. Maybe the Pipeline could add this logic and provides an pipe.stream method that will return a generator or simething like this.

The text was updated successfully, but these errors were encountered:

vblagoje · 2024-04-11T07:37:35Z

Yes @franzwilding we have this item on our roadmap, thanks for raising this issue and voicing your preferred solution.

Phlasse · 2024-05-07T09:57:23Z

@vblagoje any idea yet, when this feature will become available? We are using haystack in quite some projects now and want to know if it is worth putting more energy in our work around solution or if we can expect proper streaming out of a pipeline soon :) ?

vblagoje · 2024-05-07T17:56:58Z

Yes, I understand totally! The support is currently being worked on 😎

aymbot · 2024-07-30T12:12:13Z

@vblagoje Any updates regarding an ETA for the feature? Thanks in advance for the heads-up

vblagoje · 2024-07-30T12:19:43Z

@aymbot on our immediate roadmap for Q3, starting soon 🙏

ilkersigirci · 2024-09-12T10:06:01Z

With this feature implemented, hayhooks would be a strong alternative to langserve. Thanks again for working on it

ParseDark · 2024-09-17T11:42:35Z

really need this feature. Is there any recent update? The streaming feature is very important because most of the other third-party UIs or pkgs are called in streaming mode.

DavidSche · 2024-11-18T08:21:38Z

any update ?

aryaminus · 2024-12-24T15:43:12Z

https://dev.to/arya_minus/async-haystack-streaming-over-fastapi-endpoint-2kj0

if anyone is following this thread

masci added the enhancement New feature or request label Apr 12, 2024

masci mentioned this issue May 10, 2024

feat: add generator of StreamingHandler into pipeline output. deepset-ai/haystack#4890

Closed

masci added the P3 label May 10, 2024

mrm1001 added the topic:streaming label Jul 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

StreamingResponse #9

StreamingResponse #9

franzwilding commented Apr 11, 2024

vblagoje commented Apr 11, 2024

Phlasse commented May 7, 2024

vblagoje commented May 7, 2024

aymbot commented Jul 30, 2024

vblagoje commented Jul 30, 2024

ilkersigirci commented Sep 12, 2024 •

edited

Loading

ParseDark commented Sep 17, 2024

DavidSche commented Nov 18, 2024

aryaminus commented Dec 24, 2024

StreamingResponse #9

StreamingResponse #9

Comments

franzwilding commented Apr 11, 2024

vblagoje commented Apr 11, 2024

Phlasse commented May 7, 2024

vblagoje commented May 7, 2024

aymbot commented Jul 30, 2024

vblagoje commented Jul 30, 2024

ilkersigirci commented Sep 12, 2024 • edited Loading

ParseDark commented Sep 17, 2024

DavidSche commented Nov 18, 2024

aryaminus commented Dec 24, 2024

ilkersigirci commented Sep 12, 2024 •

edited

Loading