CTO · Chief Technology Officer CTO · 首席技术官
Agent evaluation engine · trace intelligence · data flywheel Agent 评测引擎 · Trace 智能分析 · 数据飞轮
You will own你会负责
- Architecture and implementation for trace ingestion, analysis pipelines, LLM-as-judge and regression set management. 评测引擎的系统架构与工程实现:trace 摄取、分析 pipeline、LLM-as-judge 框架和回归测试集管理。
- SDK onboarding that lets builders integrate within 30 minutes while keeping data local. 设计并推进 SDK 接入体验:让 builder 能在 30 分钟内完成接入,数据不出本地。
- Product definition: turning "evaluation" from an abstract idea into concrete modules teams can ship with. 参与产品定义:把"评测"这件抽象的事情拆成可工程化、可交付的模块。
You probably are你大概是
- Someone who has shipped LLM apps or Agent systems and understands trace, context and tool calls in production. 做过 LLM 应用 / Agent 系统完整工程链路,知道 trace、context、tool call 在生产环境里怎么流转。
- Experienced with observability, evaluation or data pipelines. LangSmith, Datadog or OpenTelemetry experience is a plus. 对可观测性、评测、数据 pipeline 有真实工程经验;做过 LangSmith、Datadog、OpenTelemetry 相关接入是加分项。
- AI native, independent, and more excited by valuable uncertainty than by a clean title. AI Native,能独立提出技术方案并动手做出来;在乎"有价值的事"多于 title,愿意在早期不确定性里换更大的空间。