BlazePose 3D

Real-Time 3D Body Pose Estimation (Single Person)

RUN

BlazePose 3D estimates 33 body keypoints in real time from your webcam or an uploaded video, and renders a live interactive 3D skeleton alongside the 2D video overlay. All inference runs entirely in the browser using MediaPipe Pose (BlazePose) (WASM) — no data leaves your device.

⚠ Single-person only. BlazePose detects exactly one person per frame — the most prominent subject in the scene. If multiple people are present, only one will be tracked.

The tool produces two complementary outputs per frame:

The following 33 BlazePose keypoints are detected:

  1. Nose
  2. Left Eye (inner)
  3. Left Eye
  4. Left Eye (outer)
  5. Right Eye (inner)
  6. Right Eye
  7. Right Eye (outer)
  8. Left Ear
  9. Right Ear
  10. Mouth (left)
  11. Mouth (right)
  12. Left Shoulder
  13. Right Shoulder
  14. Left Elbow
  15. Right Elbow
  16. Left Wrist
  17. Right Wrist
  18. Left Pinky
  19. Right Pinky
  20. Left Index
  21. Right Index
  22. Left Thumb
  23. Right Thumb
  24. Left Hip
  25. Right Hip
  26. Left Knee
  27. Right Knee
  28. Left Ankle
  29. Right Ankle
  30. Left Heel
  31. Right Heel
  32. Left Foot Index
  33. Right Foot Index

The data is exported as a CSV file structured as follows:

frame timestamp tag nose_x nose_y nose_wx nose_wy nose_wz nose_vis
0 1700000000000 0 312.4 118.7 0.021 -0.453 -0.812 0.998

Each row represents one detected frame. For each of the 33 keypoints: _x / _y are pixel coordinates in the video frame; _wx / _wy / _wz are hip-centred world coordinates (metres); _vis is the visibility score (0–1). Keypoints below the threshold are exported as empty cells.

Features

Model info

Source Code