Skip to content

Pull requests: PaddlePaddle/FastDeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Metrics] move prompt_tokens_total report to main process
#7983 opened Jun 2, 2026 by liyonghua0910 Collaborator Loading…
5 tasks
[Metrics] move prompt_tokens_total report to main process
#7982 opened Jun 2, 2026 by liyonghua0910 Collaborator Loading…
5 tasks
[XPU] limit prefill fetch num
#7981 opened Jun 2, 2026 by cmcamdy Collaborator Loading…
5 tasks
upgrade triton moe config.
#7980 opened Jun 2, 2026 by xuanyuanminzheng Collaborator Loading…
5 tasks
【cherry-pick】upgrade triton moe config.
#7979 opened Jun 2, 2026 by xuanyuanminzheng Collaborator Loading…
5 tasks
debug allreduce fusion acc issue
#7976 opened Jun 2, 2026 by BingooYang Contributor Loading…
5 tasks
[Metax][CI]: skip trap asm on MetaX GPU to fix compile error
#7975 opened Jun 2, 2026 by zhang-chenyi Contributor Loading…
5 tasks
[Cherry-Pick][RL][Feature] Add GDR streaming weight update path (#7951)
#7971 opened Jun 2, 2026 by jackyYang6 Contributor Loading…
3 of 5 tasks
[SOT] Support flashinfer_allreduce
#7970 opened Jun 2, 2026 by ZhangX-21 Contributor Loading…
5 tasks
Revert blockwise CUDAGraph and support piecewise CUDAGraph in prefill
#7969 opened Jun 2, 2026 by ZhangX-21 Contributor Loading…
5 tasks
[OP] Remove unused parameters in produce_kv_blockwise
#7968 opened Jun 2, 2026 by zhoutianzi666 Collaborator Loading…
[XPU] add as timeout
#7967 opened Jun 2, 2026 by cmcamdy Collaborator Loading…
5 tasks
[BugFix] 为decode实例增加一个守护线程去监测预分配blocks超时
#7965 opened Jun 2, 2026 by CyanScholar Contributor Loading…
4 tasks done
[Models] add fleet model fallback 2
#7964 opened Jun 2, 2026 by xiaoguoguo626807 Loading…
5 tasks done
delete useless code
#7959 opened May 29, 2026 by zhoutianzi666 Collaborator Loading…
5 tasks
[Optimization] add warmup for _sample_from_probs
#7956 opened May 29, 2026 by ckl117 Collaborator Loading…
5 tasks
Fix score calculation and support neox rope for fleet-gqa-latent
#7952 opened May 28, 2026 by chang-wenbin Collaborator Loading…
5 tasks
[RL][Feature] Add GDR streaming weight update path
#7951 opened May 28, 2026 by jackyYang6 Contributor Loading…
3 of 5 tasks
[Feature] Support new blackwell decode attention
#7949 opened May 28, 2026 by freeliuzc Collaborator Loading…
5 tasks
[BugFix] Fix potential Python type error in exception tests. contributor External developers
#7945 opened May 27, 2026 by flsgavin Loading…
5 tasks
[Scheduler] Simplify scheduler for prefill instances
#7944 opened May 27, 2026 by liyonghua0910 Collaborator Loading…
4 of 5 tasks
[Feature] Support MegaMoE
#7943 opened May 27, 2026 by Wanglongzhi2001 Collaborator Loading…
5 tasks
[Feature]Add output fallback support for OpenAI serving
#7942 opened May 27, 2026 by luukunn Collaborator Loading…
3 of 5 tasks
[Cherry-Pick][XPU] Enable CudaGraph capture for MTP draft model(#7864) contributor External developers
#7941 opened May 27, 2026 by Clarity256 Loading…
4 of 5 tasks
ProTip! no:milestone will show everything without a milestone.