You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
---原始邮件---
发件人: ***@***.***>
发送时间: 2025年1月2日(周四) 下午4:36
收件人: ***@***.***>;
抄送: "Han ***@***.******@***.***>;
主题: Re: [secretflow/scql] 关于SCQL-p2p模式高可用解决方案的疑问 (Issue #420)
1.SCQL目前支持任务重试,但如果OOM等原因导致服务挂掉,不会自动拉起(建议框架层做些保活、监测告警的工作)
2. run in kuscia有监控机制、实例保活、SCQL多实例并行、资源隔离等一些机制提升鲁棒性,但具体能否满足业务高可用的需求,建议提供相应的应用场景和kuscia同学对齐。
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you authored the thread.Message ID: ***@***.***>
1.想问下针对数据量大或耗时会导致服务挂掉的情况,SCQL是否有现有的高可用解决方案?
2.SCQL run in kuscia是否有对应高可用的解决方案?
The text was updated successfully, but these errors were encountered: