mirror of
https://github.com/ruvnet/RuView
synced 2026-06-18 11:43:19 +00:00
17471e93ff
* feat(calibration): NodeGeometry transceiver-geometry recording (ADR-152 §2.1.1) PerceptAlign-motivated geometry capture at enrollment: per-node optional records (position, antenna orientation, inter-node distances, acquisition method) — recorded when known, never required. Event-sourced via EnrollmentEvent::GeometryRecorded (latest recording wins); persisted on SpecialistBank with serde defaults so pre-ADR-152 bank JSON loads cleanly (fixture-proven, and geometry-free banks serialize byte-shape-identical to the old schema); threaded through MultiNodeMixture as data only — the learned geometry embeddings and algorithmic fusion use are §2.1.2, deliberately deferred until the ADR-151 P6 LoRA heads exist. Geometry recorded from now on means banks captured today remain usable for layout-conditioned training later — you can't retroactively add geometry to data you didn't record. 8 new tests (3 geometry, 2 anchor, 2 bank, 1 multistatic) + full-loop extension (2-node geometry, one tape-measured + one unknown, surviving the bank JSON round-trip the runtime loads from). 50/50 calibration (both feature configs) + 23 CLI tests green. Co-Authored-By: RuFlo <ruv@ruv.net> * feat(training): two-checkerboard camera↔room calibration for ADR-079 labels (ADR-152 §2.1.3) Defends the camera-supervised pipeline against PerceptAlign's "coordinate overfitting": MediaPipe keypoints were emitted in raw camera coordinates with no shared frame and no transceiver-geometry metadata — the exact label shape that memorizes deployment layout and collapses cross-layout. - scripts/calibrate-camera-room.py + calibration_lib.py: OpenCV two-checkerboard calibration → versioned bundle JSON (intrinsics, camera→room extrinsics, checkerboard spec, transceiver geometry, sha256 calibration_id). Intrinsics resolve from file > cache > multi-view computation > loud-warning 2-view fallback. - collect-ground-truth.py --calibration <bundle>: every sample gains keypoints_room (unit bearing rays from the camera center in the room frame — documented projective alignment; raw image coords preserved so training chooses), camera_origin_room, calibration_id, and the transceiver geometry stamp. Without the flag, output is byte-identical to before (tested) + a one-line ADR-152 warning. Design finding (recorded for ADR-152): a single planar checkerboard's corner grid is centrosymmetric — the reversed corner ordering fits a ghost camera pose with IDENTICAL reprojection error, so per-board flip disambiguation is mathematically ill-posed. solve_two_board_extrinsics solves the joint wall+floor set over all 4 flip combinations, where the minimum is unique — an independent reason the TWO-checkerboard method is required, beyond what PerceptAlign states. 15 headless pytest tests green (synthetic corners: extrinsics recovery incl. ghost resolution, bundle round-trip + hash stability, ray transforms w/ distortion + cross-resolution, no-calibration byte identity). Co-Authored-By: RuFlo <ruv@ruv.net> * feat(benchmarks): WiFlow-STD reproduction harness + measurement (a) results (ADR-152 §2.2) Shipped checkpoint REFUTED (0.08% PCK@20, wrong keypoint normalization); 6 reproducibility defects documented (broken imports, corrupted dataset tail with float32-max garbage that NaN-poisons fp16 BatchNorm, unreachable test phase). After repairs, retraining with upstream defaults reproduces 96.09% PCK@20 full-test / 96.61% corruption-free (published 97.25%) on RTX 5080. Claims graded MEASURED-EQUIVALENT; 2.23M params + ~0.055 GFLOPs verified. Third-party code/weights/data stay out of tree (gitignored). Co-Authored-By: claude-flow <ruv@ruv.net> * feat: ADR-152 Rust integrations + ADR-153 802.11bf protocol model - calibration: GeometryEmbedding — 32-slot permutation-invariant NodeGeometry featurization for future LoRA-head conditioning (ADR-152 §2.1.2); derived SpecialistBank::geometry_embedding() accessor; 59 tests - train: MaePretrainConfig + patchify/random-mask with UNSW measured recipe (80% masking, (30,3) patches; ADR-152 §2.3, arXiv 2511.18792); strict no-truncate/no-NaN policy; proptest properties - train: WiFlowStdModel — tch-gated port of the verified ~96%-PCK@20 WiFlow-STD architecture (ADR-152 §2.2 beyond-SOTA); ungated param formula pinned to 2,225,042; 15/17-keypoint support; 239 crate tests - hardware: ieee80211bf forward-compatibility protocol model (ADR-153): SpecProfile gates, SensingCapabilities negotiation, required ConsentMode, session FSM, SensingTransport + SimTransport + OpportunisticCsiBridge; full acceptance checklist covered; 156+4 tests - deps: ruvector bumps per ADR-152 §2.6 survey (mincut/solver 2.0.6, attention 2.1.0, gnn 2.2.0); vendor/ruvector synced to a083bd77f - docs: ADR-153 accepted; ADR-152 §2.2 status, §2.4 amendment, §2.6 added Workspace: 162 test suites green (--no-default-features); Python proof PASS. Known pre-existing flake: homecore-api env_empty_falls_back_to_defaults (unserialized env-var mutation) — untouched, follow-up. Co-Authored-By: claude-flow <ruv@ruv.net> * docs: CHANGELOG + CLAUDE.md entries for ADR-152 integrations and ADR-153 Co-Authored-By: claude-flow <ruv@ruv.net> * fix(train): repair tch-backend bit-rot — gated path compiles and tests run again Mechanical API refresh against current tch: Vec::from(Tensor) -> try_from (+ explicit flatten), numel() usize cast, Rem/div ops -> remainder() / divide_scalar_mode(floor) — the latter fixed a silent true-division bug in heatmap argmax decoding; clamp(1.0, f64::MAX) -> clamp_min (torch 2.x scalar overflow panic); petgraph EdgeRef import; missing EvalMetrics and verify_checkpoint_dir APIs that tests documented. wiflow_std roundtrip test uses safetensors (.pt _save_parameters roundtrip broken in torch 2.11 Windows). Gated: 349 passed (incl. all 20 wiflow_std); ungated: unchanged. Known pre-existing: gaussian-heatmap convention mismatch (2 tests), proof seed race under parallel threads — documented, deliberate follow-ups. Co-Authored-By: claude-flow <ruv@ruv.net> * feat(train): WiFlow-STD PyTorch->tch weight import + numerical parity proof export_to_safetensors.py maps the retrained checkpoint (295 tensors -> 248 mapped, param sum exactly 2,225,042; num_batches_tracked dropped) into a tch-loadable safetensors plus a deterministic parity fixture. Gated #[ignore] integration test loads it strictly and asserts forward-pass agreement: max abs diff 1.192e-7 on the seed-42 fixture. dump_variable_names test makes the tch name layout authoritative. Zero architecture discrepancies found. Co-Authored-By: claude-flow <ruv@ruv.net> * fix: workflow-review findings — BN gamma init, ThresholdParams serde, init docs Concurrent validation workflow (2 review lanes + adversarial verification, 13 agents): 5 confirmed findings, 3 refuted. Fixes: - wiflow_std: pin BatchNorm gamma to 1.0 (tch default draws Uniform(0,1) — silently halves activations in from-scratch training; loaded checkpoints unaffected, parity re-verified after the change) - wiflow_std: document the conv-init divergences vs the reference's effective kaiming_normal(fan_out) re-init (from-scratch dynamics only) - ieee80211bf: ThresholdParams deserialization validates via try_from so the <=100 invariant holds for untrusted payloads (+ rejection test) Benchmarks (release, ruvzen): GeometryEmbedding 1.84us/call (542k/s), MAE tokenization 7.38us/window (135k/s), 802.11bf FSM 8.9M events/s — nothing suspicious. Co-Authored-By: claude-flow <ruv@ruv.net> * docs(adr): ADR-152 §2.1.4 gate resolved — PerceptAlign repo MIT, dataset on HF Co-Authored-By: claude-flow <ruv@ruv.net> * feat(benchmarks): edge optimization measured + measurement (b) blocked + 92.9% retraction Edge optimization (ADR-152 optimize track): ONNX Runtime fp32 is the CPU latency win (3.2 ms/window, ~3.4x faster than torch, parity 2.4e-7); ORT dynamic int8 reaches 2.44 MB (paper's ~2.2 MB claim plausible only via conv-capable toolchains; -0.16pt PCK@20, +18% MPJPE, 2x slower); torch dynamic quant converts 0% of this conv-only model; fp16 halves storage free but is slower on CPU. Measurement (b) BLOCKED-ON-DATA: only 1,077 paired ESP32 windows exist (stop rule <2k). Forensic recheck of the surviving April holdout RETRACTS the ADR-079 '92.9% PCK@20' figure: constant-output model, absolute (not torso) threshold, 69 near-static frames — mean predictor scores 100% under that protocol; torso-PCK@20 is 19.1%. Corroborates PR #535. Stale citations removed from user-guide, readme-details, ADR-152 §2.1.3; no-citation rule extended to ADR-079 accuracy claims. Unblock: >=2k-window multi-pose paired session + torso-PCK re-baseline. Co-Authored-By: claude-flow <ruv@ruv.net> * docs(user-guide): corrected camera-supervised collection tutorial Step 0 CSI-rate check + session-length math (window yield = frames/20 — the May session's 8x under-delivery was a ~12 Hz CSI rate, not an aligner bug); two-checkerboard calibration step (ADR-152 §2.1.3); pose-variety and confidence guidance; torso-normalized PCK + temporal-split + pred-variance eval protocol (lessons from the 92.9% retraction); scale presets re-keyed to realistic window counts. Co-Authored-By: claude-flow <ruv@ruv.net> * feat(benchmarks): static PTQ int8 (calibrated) results + overnight capture script Conv-only static QDQ beats dynamic int8 on accuracy (PCK@20 96.61-96.63% vs 96.52%, MPJPE +10% vs +18% over fp32) at ~equal size/latency; all-ops QDQ strictly worse (int8 activations through attention glue). Entropy calibration verified bit-identical to MinMax on this data. Deployment: ONNX fp32 for speed (3.2ms), static conv-only QDQ for smallest (2.53MB). Also: scripts/overnight-empty-capture.py — segmented UDP CSI recorder for empty-room baselines (no glob collisions, detach-safe). Co-Authored-By: claude-flow <ruv@ruv.net> * feat(benchmarks): measurement (b) MEASURED — optimization transfer only, mean-pose baseline wins WiFlow-STD fine-tuned on 2,046 fresh single-room ESP32 paired windows (temporal 70/15/15, 70->540 adapter, K=17): pretrained-init 65% PCK@20 vs scratch 0% (optimization transfer) but frozen-trunk ~0% (no feature transfer), and NOTHING beats the mean-pose baseline (95.9% PCK@20 — single subject, near-static normalized coords). Honesty gates held: pred std 0.0113 (non-constant model) but mean-baseline dominance means no citable CSI->pose capability from this data. ADR-152 open question 1 answered partially; definitive answer needs multi-subject/position data. Two new aligner findings: heterogeneous csi_shape with silent zero-padding (~20%), and extractCsiMatrix's transposed shape label (frame-major data, [nSc, nFrames] label) — fixes pending. Co-Authored-By: claude-flow <ruv@ruv.net> * feat(benchmarks): efficiency sweep MEASURED — half model dominates full reference Compact WiFlow-STD variants on the same data/split/protocol: half (843,834 params, 0.38x) strictly dominates the 2.23M reference (PCK@20 96.62 vs 96.61, PCK@50 99.47 vs 99.11, MPJPE 0.00898 vs 0.0094) — the published architecture is over-parameterized for its own benchmark. quarter (338k) 96.05%; tiny (56,290 params, 1/39.5) holds 94.11% — a ~220KB fp32 edge candidate. In-domain caveats recorded; cross-domain untested. Co-Authored-By: claude-flow <ruv@ruv.net> * feat(train): compact WiFlow-STD presets in Rust + tiny edge artifact (ADR-152) WiFlowStdConfig gains half()/quarter()/tiny() mirroring the overnight sweep exactly: TcnGroupsMode (Fixed/Gcd/Depthwise), input_pw_groups, derived stride schedule and decoder-mid (all default to upstream behavior; legacy serde JSON unaffected). Param formulas pin to trained ground truth first try: 843,834 / 338,600 / 56,290; default 2,225,042 pin and 1.192e-7 parity unchanged. 248 tests green. Tiny edge artifact (tiny_edge_bench.py): ONNX fp32 = 295 KB, 0.66 ms/win (~1,500/s CPU), 94.11% PCK@20 (matches sweep clean-test exactly; parity 1.49e-7). Static int8 is a bad trade at this scale (-1.43pt, +19% MPJPE, -16% size, slower) — recorded as negative result. Export note: width-16 breaks AdaptiveAvgPool((15,1)) TorchScript export; replaced by exact mean+matmul equivalent, proven by parity. Co-Authored-By: claude-flow <ruv@ruv.net> * fix: resolve all 10 confirmed code-review findings (7-angle review, 20/20 verified) wiflow_std: min_feature_width (default 15) replaces the keypoints->stride coupling — for_keypoints(17) now provably builds the trained [2,2,2,2] graph and pools 15->17, matching the validated Python protocol (pinned by tests); param_count() total on invalid configs; random_mask returns Result and rejects non-finite/out-of-range ratios; trainer checkpoints switched to safetensors (.pt VarStore roundtrip broken on Windows torch 2.11). ieee80211bf: SBP proxy now re-triggers instances and relays reports via Action::RelaySbpReport -> SensingFrame::SbpReport (clients consume via their existing path); missed_instances reset on success = consecutive semantics; SessionTable gains a guarded SBP entry point + unknown-id drop counter; initiator-role sessions reject inbound setup/SBP requests (RejectedNotSupported) closing the idle hijack; StartSetup/StartSbp outside Idle return InvalidStateForCommand; SBP validation unified through evaluate_setup with a 1:1 SetupStatus->SbpStatus mapping. events.rs split out to honor the 500-line cap. calibration/cli: enrollment geometry now actually reaches trained banks — both production call sites attach .with_geometry; --geometry flag on train-room and POST /enroll/geometry + train-body geometry on calibrate-serve give production a recording surface; geometry-free banks log the ADR-152 §2.1.2 note. benchmarks: corruption masks committed as ground truth (unregenerable after in-place cleaning; verified bit-identical regeneration from the pristine copy) + generate_corruption_masks.py producer; _bench_common.py dedups the 5x-copied shim/evaluate/seed/remap (post-refactor PCK@20 re-verified equal to the last digit); remote scripts get the mmap patch; tiny_edge --calib validated multiple-of-64; onnx_bench --help no longer executes (and overwrote) the export — artifact restored byte-exact. Workspace: 2,963 tests passed, 0 failed; Python proof PASS. Co-Authored-By: claude-flow <ruv@ruv.net> * ci: build workspace tests without debuginfo — runner disk exhaustion The combined 38-crate debug target exceeds the GitHub runner's disk ('final link failed: No space left on device'); the same tree measured 151GB locally with full debuginfo. CARGO_PROFILE_{DEV,TEST}_DEBUG=0 shrinks the target ~5-10x; debuginfo serves no purpose in CI test runs. Co-Authored-By: claude-flow <ruv@ruv.net>
390 lines
13 KiB
Python
390 lines
13 KiB
Python
#!/usr/bin/env python3
|
|
"""Camera ground-truth collection for WiFi pose estimation training (ADR-079).
|
|
|
|
Captures webcam keypoints via MediaPipe PoseLandmarker (Tasks API) and
|
|
synchronizes with ESP32 CSI recording from the sensing server.
|
|
|
|
Output: JSONL file in data/ground-truth/ with per-frame 17-keypoint COCO poses.
|
|
|
|
With --calibration <bundle.json> (produced by scripts/calibrate-camera-room.py,
|
|
ADR-152 S2.1.3), every record is additionally stamped with room-frame bearing
|
|
rays for each keypoint, the calibration_id, and the transceiver geometry --
|
|
the PerceptAlign-style defense against coordinate overfitting. Raw image
|
|
coordinates are always kept; without depth the room-frame representation is
|
|
a projective alignment (rays, not 3D points) -- see scripts/calibration_lib.py.
|
|
Without --calibration the output is byte-identical to the original ADR-079
|
|
format.
|
|
|
|
Usage:
|
|
python scripts/collect-ground-truth.py --preview --duration 60
|
|
python scripts/collect-ground-truth.py --server http://192.168.1.10:3000
|
|
python scripts/collect-ground-truth.py --calibration data/calibration/camera-room.json
|
|
"""
|
|
|
|
from __future__ import annotations
|
|
|
|
import argparse
|
|
import json
|
|
import os
|
|
import signal
|
|
import sys
|
|
import time
|
|
import urllib.request
|
|
import urllib.error
|
|
from pathlib import Path
|
|
from datetime import datetime
|
|
|
|
import cv2
|
|
import numpy as np
|
|
|
|
import mediapipe as mp
|
|
from mediapipe.tasks.python import BaseOptions
|
|
from mediapipe.tasks.python.vision import (
|
|
PoseLandmarker,
|
|
PoseLandmarkerOptions,
|
|
RunningMode,
|
|
)
|
|
|
|
# ---------------------------------------------------------------------------
|
|
# MediaPipe 33 landmarks -> 17 COCO keypoints
|
|
# ---------------------------------------------------------------------------
|
|
# COCO idx : MP idx : joint name
|
|
# 0 : 0 : nose
|
|
# 1 : 2 : left_eye
|
|
# 2 : 5 : right_eye
|
|
# 3 : 7 : left_ear
|
|
# 4 : 8 : right_ear
|
|
# 5 : 11 : left_shoulder
|
|
# 6 : 12 : right_shoulder
|
|
# 7 : 13 : left_elbow
|
|
# 8 : 14 : right_elbow
|
|
# 9 : 15 : left_wrist
|
|
# 10 : 16 : right_wrist
|
|
# 11 : 23 : left_hip
|
|
# 12 : 24 : right_hip
|
|
# 13 : 25 : left_knee
|
|
# 14 : 26 : right_knee
|
|
# 15 : 27 : left_ankle
|
|
# 16 : 28 : right_ankle
|
|
|
|
MP_TO_COCO = [0, 2, 5, 7, 8, 11, 12, 13, 14, 15, 16, 23, 24, 25, 26, 27, 28]
|
|
|
|
COCO_BONES = [
|
|
(5, 7), (7, 9), (6, 8), (8, 10), # arms
|
|
(5, 6), # shoulders
|
|
(11, 13), (13, 15), (12, 14), (14, 16), # legs
|
|
(11, 12), # hips
|
|
(5, 11), (6, 12), # torso
|
|
(0, 1), (0, 2), (1, 3), (2, 4), # face
|
|
]
|
|
|
|
MODEL_URL = (
|
|
"https://storage.googleapis.com/mediapipe-models/"
|
|
"pose_landmarker/pose_landmarker_lite/float16/latest/"
|
|
"pose_landmarker_lite.task"
|
|
)
|
|
MODEL_FILENAME = "pose_landmarker_lite.task"
|
|
|
|
|
|
# ---------------------------------------------------------------------------
|
|
# Helpers
|
|
# ---------------------------------------------------------------------------
|
|
|
|
def ensure_model(cache_dir: Path) -> Path:
|
|
"""Download the PoseLandmarker model if not already cached."""
|
|
model_path = cache_dir / MODEL_FILENAME
|
|
if model_path.exists():
|
|
return model_path
|
|
|
|
cache_dir.mkdir(parents=True, exist_ok=True)
|
|
print(f"Downloading {MODEL_FILENAME} ...")
|
|
try:
|
|
urllib.request.urlretrieve(MODEL_URL, str(model_path))
|
|
print(f" saved to {model_path}")
|
|
except Exception as exc:
|
|
print(f"ERROR: Failed to download model: {exc}", file=sys.stderr)
|
|
print(
|
|
"Download manually from:\n"
|
|
f" {MODEL_URL}\n"
|
|
f"and place at {model_path}",
|
|
file=sys.stderr,
|
|
)
|
|
sys.exit(1)
|
|
return model_path
|
|
|
|
|
|
def post_json(url: str, payload: dict | None = None, timeout: float = 5.0) -> bool:
|
|
"""POST JSON to a URL. Returns True on success, False on failure."""
|
|
data = json.dumps(payload or {}).encode("utf-8")
|
|
req = urllib.request.Request(
|
|
url,
|
|
data=data,
|
|
headers={"Content-Type": "application/json"},
|
|
method="POST",
|
|
)
|
|
try:
|
|
with urllib.request.urlopen(req, timeout=timeout) as resp:
|
|
return 200 <= resp.status < 300
|
|
except Exception as exc:
|
|
print(f"WARNING: POST {url} failed: {exc}", file=sys.stderr)
|
|
return False
|
|
|
|
|
|
def draw_skeleton(frame: np.ndarray, keypoints: list[list[float]], w: int, h: int):
|
|
"""Draw COCO skeleton overlay on a BGR frame."""
|
|
pts = []
|
|
for x, y in keypoints:
|
|
px, py = int(x * w), int(y * h)
|
|
pts.append((px, py))
|
|
cv2.circle(frame, (px, py), 4, (0, 255, 0), -1)
|
|
|
|
for i, j in COCO_BONES:
|
|
if i < len(pts) and j < len(pts):
|
|
cv2.line(frame, pts[i], pts[j], (0, 200, 255), 2)
|
|
|
|
|
|
# ---------------------------------------------------------------------------
|
|
# Main collection loop
|
|
# ---------------------------------------------------------------------------
|
|
|
|
def main():
|
|
parser = argparse.ArgumentParser(
|
|
description="Collect camera ground-truth keypoints for WiFi pose training (ADR-079)."
|
|
)
|
|
parser.add_argument(
|
|
"--server",
|
|
default="http://localhost:3000",
|
|
help="Sensing server URL (default: http://localhost:3000)",
|
|
)
|
|
parser.add_argument(
|
|
"--preview",
|
|
action="store_true",
|
|
help="Show live skeleton overlay window",
|
|
)
|
|
parser.add_argument(
|
|
"--duration",
|
|
type=int,
|
|
default=300,
|
|
help="Recording duration in seconds (default: 300)",
|
|
)
|
|
parser.add_argument(
|
|
"--camera",
|
|
type=int,
|
|
default=0,
|
|
help="Camera device index (default: 0)",
|
|
)
|
|
parser.add_argument(
|
|
"--output",
|
|
default="data/ground-truth",
|
|
help="Output directory (default: data/ground-truth)",
|
|
)
|
|
parser.add_argument(
|
|
"--calibration",
|
|
default=None,
|
|
help="Camera-room calibration bundle JSON from scripts/calibrate-camera-room.py "
|
|
"(ADR-152 S2.1.3); adds room-frame keypoint rays + transceiver geometry "
|
|
"to every record",
|
|
)
|
|
args = parser.parse_args()
|
|
|
|
if not args.calibration:
|
|
print(
|
|
"WARNING: no --calibration bundle; labels stay in raw camera coordinates "
|
|
"and are layout-brittle (coordinate overfitting, ADR-152 S2.1.3) -- run "
|
|
"scripts/calibrate-camera-room.py first.",
|
|
file=sys.stderr,
|
|
)
|
|
|
|
# --- Resolve paths relative to repo root ---
|
|
repo_root = Path(__file__).resolve().parent.parent
|
|
output_dir = repo_root / args.output
|
|
output_dir.mkdir(parents=True, exist_ok=True)
|
|
cache_dir = repo_root / "data" / ".cache"
|
|
|
|
# --- Download / locate model ---
|
|
model_path = ensure_model(cache_dir)
|
|
|
|
# --- Open camera ---
|
|
cap = cv2.VideoCapture(args.camera)
|
|
if not cap.isOpened():
|
|
print(
|
|
f"ERROR: Cannot open camera index {args.camera}. "
|
|
"Check that a webcam is connected and not in use by another app.",
|
|
file=sys.stderr,
|
|
)
|
|
sys.exit(1)
|
|
|
|
frame_w = int(cap.get(cv2.CAP_PROP_FRAME_WIDTH))
|
|
frame_h = int(cap.get(cv2.CAP_PROP_FRAME_HEIGHT))
|
|
print(f"Camera opened: {frame_w}x{frame_h}")
|
|
|
|
# --- Load calibration bundle (ADR-152 S2.1.3) ---
|
|
calib_ctx = None
|
|
if args.calibration:
|
|
# Lazy import keeps the no-calibration path identical to the original.
|
|
sys.path.insert(0, str(Path(__file__).resolve().parent))
|
|
import calibration_lib
|
|
|
|
try:
|
|
calib_ctx = calibration_lib.load_calibration_context(
|
|
Path(args.calibration), frame_w, frame_h
|
|
)
|
|
except (OSError, ValueError, json.JSONDecodeError) as exc:
|
|
print(f"ERROR: Cannot load calibration bundle {args.calibration}: {exc}",
|
|
file=sys.stderr)
|
|
sys.exit(1)
|
|
n_nodes = len(calib_ctx.transceiver_geometry.get("nodes", []))
|
|
print(f"Calibration: {calib_ctx.calibration_id[:23]}... "
|
|
f"({n_nodes} transceiver node(s)); emitting room-frame keypoint rays")
|
|
|
|
# --- Create PoseLandmarker ---
|
|
options = PoseLandmarkerOptions(
|
|
base_options=BaseOptions(model_asset_path=str(model_path)),
|
|
running_mode=RunningMode.IMAGE,
|
|
num_poses=1,
|
|
min_pose_detection_confidence=0.5,
|
|
min_pose_presence_confidence=0.5,
|
|
min_tracking_confidence=0.5,
|
|
)
|
|
landmarker = PoseLandmarker.create_from_options(options)
|
|
|
|
# --- Output file ---
|
|
timestamp_str = datetime.now().strftime("%Y%m%d_%H%M%S")
|
|
out_path = output_dir / f"keypoints_{timestamp_str}.jsonl"
|
|
out_file = open(out_path, "w", encoding="utf-8")
|
|
print(f"Output: {out_path}")
|
|
|
|
# --- Start CSI recording ---
|
|
recording_url_start = f"{args.server}/api/v1/recording/start"
|
|
recording_url_stop = f"{args.server}/api/v1/recording/stop"
|
|
csi_started = post_json(recording_url_start)
|
|
if csi_started:
|
|
print("CSI recording started on sensing server.")
|
|
else:
|
|
print(
|
|
"WARNING: Could not start CSI recording. "
|
|
"Camera keypoints will still be captured.",
|
|
file=sys.stderr,
|
|
)
|
|
|
|
# --- Graceful shutdown ---
|
|
shutdown_requested = False
|
|
|
|
def _handle_signal(signum, frame):
|
|
nonlocal shutdown_requested
|
|
shutdown_requested = True
|
|
|
|
signal.signal(signal.SIGINT, _handle_signal)
|
|
signal.signal(signal.SIGTERM, _handle_signal)
|
|
|
|
# --- Collection loop ---
|
|
start_time = time.monotonic()
|
|
frame_count = 0
|
|
total_confidence = 0.0
|
|
total_visible = 0
|
|
|
|
print(f"Collecting for {args.duration}s ... (press 'q' in preview to stop)")
|
|
|
|
try:
|
|
while not shutdown_requested:
|
|
elapsed = time.monotonic() - start_time
|
|
if elapsed >= args.duration:
|
|
break
|
|
|
|
ret, frame = cap.read()
|
|
if not ret:
|
|
print("WARNING: Failed to read frame, retrying ...", file=sys.stderr)
|
|
time.sleep(0.01)
|
|
continue
|
|
|
|
ts_ns = time.time_ns()
|
|
|
|
# Convert BGR -> RGB for MediaPipe
|
|
rgb = cv2.cvtColor(frame, cv2.COLOR_BGR2RGB)
|
|
mp_image = mp.Image(image_format=mp.ImageFormat.SRGB, data=rgb)
|
|
|
|
result = landmarker.detect(mp_image)
|
|
|
|
n_persons = len(result.pose_landmarks)
|
|
|
|
if n_persons > 0:
|
|
landmarks = result.pose_landmarks[0]
|
|
keypoints = []
|
|
visibilities = []
|
|
for coco_idx in range(17):
|
|
mp_idx = MP_TO_COCO[coco_idx]
|
|
lm = landmarks[mp_idx]
|
|
keypoints.append([round(lm.x, 5), round(lm.y, 5)])
|
|
visibilities.append(lm.visibility if lm.visibility else 0.0)
|
|
|
|
confidence = float(np.mean(visibilities))
|
|
n_visible = int(sum(1 for v in visibilities if v > 0.5))
|
|
else:
|
|
keypoints = []
|
|
confidence = 0.0
|
|
n_visible = 0
|
|
|
|
record = {
|
|
"ts_ns": ts_ns,
|
|
"keypoints": keypoints,
|
|
"confidence": round(confidence, 4),
|
|
"n_visible": n_visible,
|
|
"n_persons": n_persons,
|
|
}
|
|
if calib_ctx is not None:
|
|
# Adds keypoints_room (bearing rays), camera_origin_room,
|
|
# calibration_id, transceiver_geometry (ADR-152 S2.1.3).
|
|
record = calibration_lib.augment_record(record, calib_ctx)
|
|
out_file.write(json.dumps(record) + "\n")
|
|
frame_count += 1
|
|
total_confidence += confidence
|
|
total_visible += n_visible
|
|
|
|
# Preview overlay
|
|
if args.preview and keypoints:
|
|
draw_skeleton(frame, keypoints, frame_w, frame_h)
|
|
|
|
if args.preview:
|
|
remaining = max(0, int(args.duration - elapsed))
|
|
cv2.putText(
|
|
frame,
|
|
f"Frames: {frame_count} Visible: {n_visible}/17 Time: {remaining}s",
|
|
(10, 30),
|
|
cv2.FONT_HERSHEY_SIMPLEX,
|
|
0.7,
|
|
(255, 255, 255),
|
|
2,
|
|
)
|
|
cv2.imshow("Ground Truth Collection (ADR-079)", frame)
|
|
if cv2.waitKey(1) & 0xFF == ord("q"):
|
|
break
|
|
|
|
finally:
|
|
# --- Cleanup ---
|
|
out_file.close()
|
|
cap.release()
|
|
if args.preview:
|
|
cv2.destroyAllWindows()
|
|
landmarker.close()
|
|
|
|
# Stop CSI recording
|
|
if csi_started:
|
|
if post_json(recording_url_stop):
|
|
print("CSI recording stopped.")
|
|
else:
|
|
print("WARNING: Failed to stop CSI recording.", file=sys.stderr)
|
|
|
|
# --- Summary ---
|
|
avg_conf = total_confidence / frame_count if frame_count > 0 else 0.0
|
|
avg_vis = total_visible / frame_count if frame_count > 0 else 0.0
|
|
print()
|
|
print("=== Collection Summary ===")
|
|
print(f" Total frames: {frame_count}")
|
|
print(f" Avg confidence: {avg_conf:.3f}")
|
|
print(f" Avg visible joints: {avg_vis:.1f} / 17")
|
|
print(f" Output: {out_path}")
|
|
|
|
|
|
if __name__ == "__main__":
|
|
main()
|