Kuberay Worker에 Triton 모델 등록

이 skill은 Triton Inference Server 모델을 kuberay_worker에서 사용할 수 있도록 Ray Serve deployment와 app을 등록합니다.

작업 흐름

1단계: 추가 가능한 모델 목록 표시

다음 경로에서 Triton 모델 목록을 확인합니다:

submodules/Toonkit-Triton-heavy/models/
submodules/Toonkit-Triton-light/models/

그리고 src/kuberay_worker/models/에 이미 등록된 모델을 확인합니다.

Triton에는 있지만 kuberay_worker에는 없는 모델만 목록으로 표시하고, 사용자에게 어떤 모델을 추가할지 질문합니다.

2단계: 참조할 기존 모델 선택

src/kuberay_worker/schemas/와 src/kuberay_worker/models/에 있는 기존 모델 파일들을 나열하고, 사용자에게 다음을 질문합니다:

어떤 기존 모델의 패턴을 참고할 것인지? (예: zimage, inpaint, sam2 등)
추가로 참고해야 할 파일이나 문서가 있는지?

3단계: 추가 주의사항 확인

사용자에게 다음을 질문합니다:

Kuberay Worker에 Triton 모델 등록

이 skill은 Triton Inference Server 모델을 kuberay_worker에서 사용할 수 있도록 Ray Serve deployment와 app을 등록합니다.

작업 흐름

1단계: 추가 가능한 모델 목록 표시

다음 경로에서 Triton 모델 목록을 확인합니다:

submodules/Toonkit-Triton-heavy/models/
submodules/Toonkit-Triton-light/models/

그리고 src/kuberay_worker/models/에 이미 등록된 모델을 확인합니다.

Triton에는 있지만 kuberay_worker에는 없는 모델만 목록으로 표시하고, 사용자에게 어떤 모델을 추가할지 질문합니다.

2단계: 참조할 기존 모델 선택

src/kuberay_worker/schemas/와 src/kuberay_worker/models/에 있는 기존 모델 파일들을 나열하고, 사용자에게 다음을 질문합니다:

어떤 기존 모델의 패턴을 참고할 것인지? (예: zimage, inpaint, sam2 등)
추가로 참고해야 할 파일이나 문서가 있는지?

3단계: 추가 주의사항 확인

사용자에게 다음을 질문합니다:

항목	규칙	예시
파일명	언더스코어 제거, 소문자	`flux_klein_i2i` → `fluxkleini2i.py`
Schema 클래스명	PascalCase + Schema	`FluxKleinI2iSchema`
Deployment 클래스명	PascalCase + Deployment	`FluxKleinI2iDeployment`
App 함수명	파일명 + `_app`	`fluxkleini2i_app`
MODEL_NAME	Triton config의 `name`과 동일	`"flux_klein_i2i"`

Triton 타입	NumPy 변환
TYPE_STRING	`np.array([값.encode("utf-8")])`
TYPE_INT32	`np.array([값], dtype=np.int32)`
TYPE_INT64	`np.array([값], dtype=np.int64)`
TYPE_FP32	`np.array([값], dtype=np.float32)`
TYPE_UINT8 (이미지)	`np.asarray(이미지, dtype=np.uint8)`

Add Triton To Rayworker

Kuberay Worker에 Triton 모델 등록

작업 흐름

1단계: 추가 가능한 모델 목록 표시

2단계: 참조할 기존 모델 선택

3단계: 추가 주의사항 확인

Add Triton To Rayworker

Kuberay Worker에 Triton 모델 등록

작업 흐름

1단계: 추가 가능한 모델 목록 표시

2단계: 참조할 기존 모델 선택

3단계: 추가 주의사항 확인

4단계: 최종 작업 계획 수립

5단계: 구현 진행

6단계: 문서화

참고: 기본 네이밍 규칙

참고: Triton 데이터 타입 매핑

Bluebubbles

Add Tracing

Analytics Events

Add Expert

Arthas

Arthas Eagleeye Traceid