Develop by solderzzc · Pull Request #160 · SharpAI/DeepCamera

solderzzc · 2026-03-16T17:34:26Z

No description provided.

- Move skill: skills/annotation/ → skills/segmentation/ - Add deploy scripts (deploy.sh, deploy.bat) - Update README: new Segmentation category row, mark as ✅ Ready - Update skills.json with sam2-segmentation entry

…ls.json - New skill: skills/annotation/dataset-management/ with deploy scripts - Update sam2-segmentation SKILL.md - Update skills.json with new entries

…0 lines deleted)

feat: TensorRT FP16 backend for depth estimation

…, and benchmark - deploy.bat: Windows bootstrapper with Python discovery, venv creation, env_config hardware detection, CUDA/CPU pip install, and TensorRT pre-build - requirements_cpu.txt: CPU-only PyTorch + depth-anything-v2 dependencies - requirements_cuda.txt: CUDA 12.4 PyTorch + TensorRT dependencies - benchmark.py: cross-platform benchmark supporting CoreML and PyTorch/TRT - deploy.sh: updated existing shell script

…enchmark scripts - transform.py: add ONNX model download and inference path - deploy.bat: refine Python discovery and dependency flow - requirements: update dependency specs - benchmark.py: unify inference routing for CoreML/ONNX/PyTorch

…ariant structure

…ploy script - transform.py: refine available EP detection for ONNX inference - deploy.bat: update dependency installation flow

…Call

- Remove assistant prefill injection that caused 400 errors with Qwen3.5 when enable_thinking is active - Remove presence_penalty from JSON-expected requests - Fix VLM/LLM split to only count image analysis suites as VLM

…ntent - Append JSON-only instruction to last user message for local models - Replace null content with empty string for llama-server compatibility

- transform.py: improve ONNX runtime execution provider handling - requirements_cuda.txt: add onnxruntime-gpu dependency

- Auto-detect mode: default to 'llm' when no VLM URL is set - Convert tool_calls/tool messages to plain text for llama-server compat - Smart max_tokens: use max_completion_tokens for cloud, max_tokens for local - Expand stripThink to handle Qwen3.5 plain-text reasoning blocks - Harden JSON parser: clean ellipsis, placeholder tags, trailing commas - Always exit 0 in skill mode (results reported via JSON events)

- Add stream_options: { include_usage: true } for OpenAI API token reporting - Fall back to chunk-counted completion tokens for local llama-server - Track per-test tokens via _currentTestTokens accumulator - Include token data in test results, log output, and emitted events - Auto-detect LLM-only mode when no VLM URL is provided - Use max_completion_tokens for cloud APIs (GPT-5.4+), max_tokens for local - Always exit 0 in Aegis skill mode regardless of test failures

- Remove stream_options for local llama-server (causes crashes) - Drop max_tokens — streaming 2000-token cap is safety net - Enhance parseJSON for multi-word <placeholder> tags - Add JSON extraction fallback from reasoning_content - Simplify prompt template to avoid template echoing - Fix process.exit(1) in skill mode for clean status

fix(benchmark): disable thinking mode & improve JSON parsing

…ll auto-start - Update benchmark paper: 131→143 tests (VLM Scene 35→47, 3 new dedup scenarios, 4 new tool-use scenarios) - Add performance metrics to run-benchmark.cjs (TTFT, decode throughput tracking) - Fix tool_call argument serialization for non-string arguments - Enable auto_start for yolo-detection-2026 and depth-estimation skills - Add LaTeX build artifacts .gitignore

feat: expand HomeSec-Bench to 143 tests, add perf metrics, enable ski…

…et() calls

- Renamed 'YOLO 2026 Object Detection' → 'YOLO 2026' - Renamed 'Depth Estimation (Privacy)' → 'Depth Anything V2' - Added disabled: true to Model Training, SAM2 Segmentation, Annotation Data

feat: rename skills for sidebar clarity and disable unstable skills

- Ship pre-built yolo26n.onnx (9.5MB) and yolo26n_names.json - Add _OnnxCoreMLModel wrapper using onnxruntime + CoreMLExecutionProvider - Bypasses macOS 26.x MPSGraph MLIR crash (SIGABRT in MPSGraphExecutable.mm) - Inference: 11ms/frame (~91 FPS) on Apple M5 Pro - Strip requirements_mps.txt: remove torch/torchvision/ultralytics (~120MB -> ~17MB) - Class names loaded from JSON instead of .pt (no torch dependency at runtime)

Feature/onnx coreml inference

Add pre-exported ONNX models for all four detection sizes: - yolo26n.onnx (9.5 MB) — nano (already shipped) - yolo26s.onnx (37 MB) — small - yolo26m.onnx (78 MB) — medium - yolo26l.onnx (95 MB) — large Each includes a companion _names.json with COCO 80 class labels. Eliminates torch/ultralytics dependency for all model sizes.

This reverts commit df7b4b8.

- Revert shipping s/m/l ONNX models in repo (~210MB saved) - Keep only yolo26n.onnx (9.5MB) shipped for zero-config default - Add _download_onnx_from_hf() for s/m/l: downloads from onnx-community/yolo26{s,m,l}-ONNX on first use - Uses stdlib urllib (no extra dependencies) - Auto-copies class names from shipped yolo26n_names.json

Feature/onnx coreml inference

- Replace ultralytics-exported yolo26n.onnx with onnx-community version - Update _OnnxCoreMLModel to parse HF format: logits [1,300,80] + pred_boxes [1,300,4] - All YOLO26 sizes (n/s/m/l) now use the same onnx-community format - Verified: 15-25ms/frame on M5 Pro with CoreML EP, correct detections

refactor: standardize on onnx-community HuggingFace ONNX format

- Redesign generate-report.cjs as a multi-view Operations Center - Three tabs: Performance, Quality, Vision - Run picker sidebar with model-grouped history + multi-select - Comparison tables across selected runs - Export to Markdown for community sharing - Add live progress mode (auto-refresh + LIVE banner) - Intermediate saves after each suite completes - Browser auto-opens with pulsing progress indicator - Auto-refreshes every 5s during benchmark run - Save VLM fixture metadata (filename, response, prompt) per test - Embed all data inline for fully self-contained HTML

feat: benchmark Operations Center with live progress dashboard

- saveLiveProgress() called after each test, not just each suite - Include in-progress suite in live data for Quality/Vision tabs - Skip fixture image embedding in live mode (~43MB savings per regeneration) - Enhanced live banner with test name and test count

- Use HTML entities (') for quotes in onclick to avoid multi-level escaping - Replace <meta http-equiv=refresh> with JS setTimeout for stateful reload - Preserve active tab + scroll position across refreshes via sessionStorage

- Compute live perfSummary from accumulated TTFT/decode arrays - TTFT, Decode Speed, Server Prefill/Decode now update in real-time - Fix SyntaxError: use HTML entities for collapsed toggle onclick - Replace meta refresh with JS setTimeout + sessionStorage state

…tats

- sampleResourceMetrics() parses ioreg for Apple Silicon MPS stats - GPU utilization, renderer %, GPU memory, system memory tracked - Sampled after each suite, included in live perfSummary - 3 new hero cards: GPU Utilization, GPU Memory, System Memory

…ionStorage

…throughput)

Feature/benchmark operations center

solderzzc and others added 30 commits March 15, 2026 14:27

refactor: move sam2-segmentation to own Segmentation category

3784c9b

- Move skill: skills/annotation/ → skills/segmentation/ - Add deploy scripts (deploy.sh, deploy.bat) - Update README: new Segmentation category row, mark as ✅ Ready - Update skills.json with sam2-segmentation entry

docs: update annotation skill — mark as Ready, expand description

b15a40c

feat: add dataset-management skill, update sam2-segmentation and skil…

f043dd2

…ls.json - New skill: skills/annotation/dataset-management/ with deploy scripts - Update sam2-segmentation SKILL.md - Update skills.json with new entries

feat: add TensorRT FP16 backend for depth estimation (additive-only, …

adc3859

…0 lines deleted)

Merge pull request #159 from SharpAI/feature/tensorrt-fp16-backend

5bc4262

feat: TensorRT FP16 backend for depth estimation

refactor(depth-estimation): update models.json with ONNX format and v…

952e75b

…ariant structure

fix(depth-estimation): improve ONNX runtime provider detection and de…

20cea81

…ploy script - transform.py: refine available EP detection for ONNX inference - deploy.bat: update dependency installation flow

fix: use max_tokens instead of max_completion_tokens in benchmark llm…

9769445

…Call

fix(benchmark): remove assistant prefill, fix VLM suite counting

48f9bf8

- Remove assistant prefill injection that caused 400 errors with Qwen3.5 when enable_thinking is active - Remove presence_penalty from JSON-expected requests - Fix VLM/LLM split to only count image analysis suites as VLM

feat(depth-estimation): add skill configuration schema (config.yaml)

e5a4bfa

fix(benchmark): add JSON guidance suffix and sanitize null message co…

5e4e528

…ntent - Append JSON-only instruction to last user message for local models - Replace null content with empty string for llama-server compatibility

feat(depth-estimation): enhance ONNX inference and CUDA requirements

eb9dfbe

- transform.py: improve ONNX runtime execution provider handling - requirements_cuda.txt: add onnxruntime-gpu dependency

Merge pull request #162 from SharpAI/feature/benchmark-thinking-mode-fix

e9d7d4a

fix(benchmark): disable thinking mode & improve JSON parsing

Merge branch 'develop' into feature/benchmark-thinking-mode-fix

3e03a35

Merge pull request #163 from SharpAI/feature/benchmark-thinking-mode-fix

7d117e9

feat: expand HomeSec-Bench to 143 tests, add perf metrics, enable ski…

feat: change default depth colormap from inferno to viridis

916a5a7

fix: replace all hardcoded inferno fallbacks with viridis in config.g…

fd3f15b

…et() calls

feat: rename skills for sidebar clarity and disable unstable skills

6d83118

- Renamed 'YOLO 2026 Object Detection' → 'YOLO 2026' - Renamed 'Depth Estimation (Privacy)' → 'Depth Anything V2' - Added disabled: true to Model Training, SAM2 Segmentation, Annotation Data

Merge pull request #164 from SharpAI/feature/skills-sidebar-restructure

a68cd16

feat: rename skills for sidebar clarity and disable unstable skills

Merge pull request #165 from SharpAI/feature/onnx-coreml-inference

59cba25

Feature/onnx coreml inference

Revert "feat: ship all YOLO26 model sizes as pre-built ONNX"

ecf8948

This reverts commit df7b4b8.

solderzzc and others added 20 commits March 18, 2026 11:49

Merge branch 'develop' into feature/onnx-coreml-inference

10c4cbf

Merge pull request #166 from SharpAI/feature/onnx-coreml-inference

9fc4e81

Feature/onnx coreml inference

Merge pull request #168 from SharpAI/feature/onnx-hf-standardize

72c0b0a

refactor: standardize on onnx-community HuggingFace ONNX format

Merge pull request #169 from SharpAI/feature/benchmark-operations-center

884e270

feat: benchmark Operations Center with live progress dashboard

feat: emit open_report event for Aegis embedded browser

a0c9a44

fix: scrape server metrics after each suite for live prefill/decode s…

c59668e

…tats

fix: preserve previous runs in live index for comparison sidebar

8f43342

fix: error handling for tab rendering + resource data in final index

36ac255

fix: persist selection and primary index across live reloads via sess…

6ea1463

…ionStorage

feat: high-level quality comparison table (pass rate, LLM/VLM, time, …

90e11c4

…throughput)

fix: hide VLM Score row when no runs have VLM data

40d5f64

Merge branch 'develop' into feature/benchmark-operations-center

b5bc285

Merge pull request #170 from SharpAI/feature/benchmark-operations-center

a0b3feb

Feature/benchmark operations center

Merge branch 'master' into develop

9b068cd

solderzzc merged commit d51176f into master Mar 18, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Develop#160

Develop#160
solderzzc merged 50 commits intomasterfrom
develop

solderzzc commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

solderzzc commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants