Is there the way of parallel prompt ? #69

DavideHe · 2023-11-21T02:51:45Z

as (run_streaming_llama.py#L61)[https://github.com/mit-han-lab/streaming-llm/blob/main/examples/run_streaming_llama.py#L61] see, prompt must be send to model one by one. that will take the high GPU usage time.
Is there the way of parallel prompt ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there the way of parallel prompt ? #69

Is there the way of parallel prompt ? #69

DavideHe commented Nov 21, 2023

Is there the way of parallel prompt ? #69

Is there the way of parallel prompt ? #69

Comments

DavideHe commented Nov 21, 2023