Blame
| 75fbd9 | Anonymous | 2026-04-15 07:58:32 | 1 | # Indra-Swarm API |
| 6ac36d | Anonymous | 2026-04-15 07:50:35 | 2 | |
| 3 | ## Gemma-4-26B-A4B-it |
|||
| 3b63b8 | Anonymous | 2026-04-15 07:52:30 | 4 | |
| 7b949e | Anonymous | 2026-04-15 07:57:05 | 5 | [Model info](https://huggingface.co/google/gemma-4-26B-A4B-it) |
| 6ac36d | Anonymous | 2026-04-15 07:50:35 | 6 | |
| 7 | **Docker image's Current Sglang Config** |
|||
| fc5c99 | Anonymous | 2026-04-15 07:51:06 | 8 | |
| 9 | ||||
| d5a95c | Anonymous | 2026-04-15 07:52:06 | 10 | ``` |
| 6ac36d | Anonymous | 2026-04-15 07:50:35 | 11 | python3 -m sglang.launch_server |
| 12 | --model-path google/gemma-4-26b-a4b-it |
|||
| 13 | --tp 2 |
|||
| 14 | --port 3000 |
|||
| 15 | --host 0.0.0.0 |
|||
| 16 | --attention-backend triton |
|||
| 17 | --mem-fraction-static 0.8 |
|||
| 18 | --max-running-requests 128 |
|||
| 19 | --chunked-prefill-size 4096 |
|||
| 20 | --context-length 32768 |
|||
| 21 | --trust-remote-code |
|||
| 22 | --enable-piecewise-cuda-graph |
|||
| 23 | --schedule-policy lpm |
|||
| d5a95c | Anonymous | 2026-04-15 07:52:06 | 24 | ``` |
| e30422 | Anonymous | 2026-04-15 07:50:55 | 25 | |
| 6ac36d | Anonymous | 2026-04-15 07:50:35 | 26 | **Test Curl** |
| e30422 | Anonymous | 2026-04-15 07:50:55 | 27 | |
| 7b949e | Anonymous | 2026-04-15 07:57:05 | 28 | ``` |
| 4ccd72 | Anonymous | 2026-04-15 08:16:19 | 29 | curl http://192.168.40.40:3000/v1/chat/completions \ |
| 7b949e | Anonymous | 2026-04-15 07:57:05 | 30 | -H "Content-Type: application/json" \ |
| 31 | -d '{ |
|||
| 32 | "model": "google/gemma-4-26b-a4b-it", |
|||
| 33 | "messages": [{"role": "user", "content": "System check. Are you online?"}] |
|||
| 34 | }' |
|||
| 35 | ``` |
|||
| 36 | ||||
| 37 | ## faster-Qwen3-tts |
|||
| 38 | ||||
| 39 | [Model info](https://github.com/andimarafioti/faster-qwen3-tts) |
|||
| 40 | ||||
| 41 | **Test Curl** |
|||
| 42 | ||||
| d5a95c | Anonymous | 2026-04-15 07:52:06 | 43 | ``` |
| b4a67b | Anonymous | 2026-04-16 06:00:40 | 44 | curl -X POST http://192.168.40.40:8002/v1/audio/speech \ |
| 45 | -H "Content-Type: application/json" \ |
|||
| 46 | -d '{ |
|||
| 47 | "model": "tts-1", |
|||
| 48 | "input": "This is a text-to-speech system check. Audio synthesis is functional on Indra.", |
|||
| 49 | "voice": "nona", |
|||
| 50 | "response_format": "wav", |
|||
| 6ac36d | Anonymous | 2026-04-15 07:50:35 | 51 | "seed": 42 |
| b4a67b | Anonymous | 2026-04-16 06:00:40 | 52 | }' \ |
| 53 | --output tts_test.wav |
|||
| d5a95c | Anonymous | 2026-04-15 07:52:06 | 54 | ``` |
| 690379 | Anonymous | 2026-04-15 08:14:25 | 55 | |
| e435db | Anonymous | 2026-04-16 06:02:18 | 56 | to change voices, set "voice" to any of the following: |
| 57 | - aus-female-1 |
|||
| 58 | - aus-female-2 |
|||
| 59 | - aus-female-3 |
|||
| 60 | - aus-female-4 |
|||
| 61 | - aus-female-5 |
|||
| 62 | - aus-female-6 |
|||
| 63 | - aus-male-1 |
|||
| 64 | - aus-male-2 |
|||
| 65 | - aus-male-3 |
|||
| 66 | - aus-male-4 |
|||
| 67 | - aus-male-5 |
|||
| 68 | - aus-male-6 |
|||
| 69 | - aus-male-7 |
|||
| 70 | - charter |
|||
| 71 | - gaius |
|||
| 72 | - _gantry |
|||
| 73 | - nona |
|||
| 74 | - oni |
|||
| 75 | - vulcan |
|||
| 690379 | Anonymous | 2026-04-15 08:14:25 | 76 | |
| 77 | ||||
| 78 | ||||
| 79 | ## faster-whisper-large-v3-turbo-ct2 |
|||
| 80 | ||||
| 81 | [Model info](https://huggingface.co/deepdml/faster-whisper-large-v3-turbo-ct2) |
|||
| 82 | ||||
| 83 | **Test Curl** |
|||
| 84 | Uses a known locally saved audio file on the indra machine for testing |
|||
| 85 | ||||
| 86 | ``` |
|||
| 6ea707 | Anonymous | 2026-04-15 08:14:58 | 87 | curl http://192.168.40.40:8005/v1/audio/transcriptions \ |
| 690379 | Anonymous | 2026-04-15 08:14:25 | 88 | -H "Content-Type: multipart/form-data" \ |
| 89 | -F "file=@/mnt/nvme3n1/swarm/voice-samples/aus-male-1.wav" \ |
|||
| 90 | -F "model=deepdml/faster-whisper-large-v3-turbo-ct2" \ |
|||
| 91 | -F "response_format=json" |
|||
| 92 | ``` |
