microsoft / onnxruntime Public

Notifications You must be signed in to change notification settings
Fork 2.9k
Star 14.8k

Code
Issues 2.4k
Pull requests 513
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: microsoft/onnxruntime

Labels 65 Milestones 2

New pull request New

513 Open 14,802 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Implementation of flash attention for native webgpu ep

#22932 opened Nov 24, 2024 by sushraja-msft

Loading…

Update index.md

#22929 opened Nov 22, 2024 by parinitarahi

Loading…

Bump onnx from 1.16.1 to 1.17.0 in /onnxruntime/python/tools/transformers/models/phi2 dependencies

Pull requests that update a dependency file

python

Pull requests that update Python code

#22928 opened Nov 22, 2024 by dependabot bot

Loading…

Update pipeline status

#22924 opened Nov 22, 2024 by tianleiwu

Loading…

Cjian/java gradle

#22923 opened Nov 21, 2024 by jchen351 • Draft

[TensorRT EP] Use TRT/CUDA/ORT version from runtime instead of build time to generate hash value

#22921 opened Nov 21, 2024 by chilo-ms

Loading…

[VSINPU]Split\Pad and some element-wise OPs support

#22916 opened Nov 21, 2024 by xuke537

Loading…

[js/webgpu] support FlashAttention-2 for attention operator ep:WebGPU

ort-web webgpu provider

#22915 opened Nov 21, 2024 by xhcao

Loading…

MlasTranspose multi-threads support.

#22912 opened Nov 21, 2024 by msy-kato

Loading…

[QNN EP] [DRAFT] Support Conv float weight/bias.

#22906 opened Nov 20, 2024 by adrianlizarraga • Draft

Update Onnxruntime download version for GenAI

#22900 opened Nov 20, 2024 by ajindal1

Loading…

Override android qnn sdk version with pipeline param

#22895 opened Nov 19, 2024 by sheetalarkadam

Loading…

Update Intel Thread Counts

#22894 opened Nov 19, 2024 by A-Satti

Loading…

[TensorRT EP] Add new provider option to exclude specific ops from running on TRT

#22892 opened Nov 19, 2024 by chilo-ms • Draft

#22890 Fix profiling on empty Optional

#22891 opened Nov 19, 2024 by amancini-N

Loading…

Quantize Bias for Conv/Gemm on Quantized Model

#22889 opened Nov 19, 2024 by centwang • Draft

Add Optional Activation node to NodeUnit

#22888 opened Nov 19, 2024 by centwang • Draft

[js/webgpu] Enable graph capture with memcpy

#22883 opened Nov 19, 2024 by axinging • Draft

[WebNN] Support negative steps for slice ep:WebNN

WebNN execution provider

#22871 opened Nov 18, 2024 by shiyi9801

Loading…

Build DML in Windows GPU CI pipeline

#22869 opened Nov 18, 2024 by mszhanyi

Loading…

Refactor emulator start and stop functions for clarity and efficiency platform:mobile

issues related to ONNX Runtime mobile; typically submitted using template

#22861 opened Nov 16, 2024 by jchen351

Loading…

Keep the model metadata on the generated EP context model (use bridge api)

#22860 opened Nov 15, 2024 by chilo-ms

Loading…

[TensorRT EP] Fix wrong input order when generating IndexedSubGraph

#22857 opened Nov 15, 2024 by chilo-ms

Loading…

Enable QNN HTP spill fill buffer setting to save RAM usage. ep:QNN

issues related to QNN exeution provider

#22853 opened Nov 15, 2024 by HectorSVC

Loading…

Int4 support

#22850 opened Nov 15, 2024 by BoarQing • Draft

Previous 1 2 3 4 5 … 20 21 Next

Previous Next

ProTip! Exclude everything labeled bug with -label:bug.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly