Public Dataset Scoreboard¶

bagx scores public rosbag datasets with a single command (bagx eval). This page is generated from benchmarks/scoreboard.json so scores stay reproducible across releases.

Scores are readiness hints, not SLAM trajectory accuracy. They summarize sensor rates, sync, IMU noise, and stack-specific topic coverage (Nav2 / Autoware / MoveIt / perception).

Analysis articles¶

30 public datasets tracked · 26 scored · bagx 0.6.0

Dataset	Domain	ROS	Overall	IMU	Sync	Key finding	bagx
EuRoC MH_01_easy	SLAM	ros1	60.3	23.1	97.6	WARNING IMU accel noise 0.7935 m/s² — noisy, LiDAR-only odometry may outperform LIO	0.6.0
Hilti SLAM Challenge 2022	SLAM	ros2	—	—	—	Pending download — construction-site LiDAR+IMU benchmark	—
Livox MID-360	SLAM	ros2	83.7	97.3	70.0	IMU is excellent but LiDAR↔IMU sync ~25ms — enable deskew	0.4.0
M2DGR (urban street)	SLAM	ros1	—	—	—	Pending download (~21GB) — multi-sensor GNSS/IMU/LiDAR urban dataset	—
NTU VIRAL (drone)	SLAM	ros2	79.8	59.5	100.0	Noisy IMU — LiDAR-only SLAM (e.g. KISS-ICP) recommended	0.4.0
Newer College (handheld)	SLAM	ros2	92.3	84.5	100.0	LiDAR+IMU quality is strong enough for SLAM benchmarking	0.4.0
Ouster OS0-32 (static)	SLAM	ros2	76.6	79.6	73.6	50Hz IMU is slow for deskew; sync margin is thin	0.4.0
TUM VI calib-imu1	SLAM	ros1	95.4	90.8	100.0	IMU accel noise 0.0515 m/s² — good for LIO, set imu_acc_noise_density to 0.0515	0.6.0
UrbanLoco (Alameda)	SLAM	ros1	—	—	—	Pending download (multi-GB) — long urban GNSS/IMU/LiDAR sequence	—
UrbanNav HK TST	SLAM	ros1	—	—	—	Pending download (~35GB) — urban canyon GNSS + LiDAR fusion bag	—
AutoCore Ouster OS1-64	Autoware	ros2	100.0	—	—	LiDAR (/sensing/lidar/top/rectified/pointcloud) at 10Hz	0.6.0
Autoware all-sensors-bag1	Autoware	ros2	90.2	85.8	75.1	GNSS fix rate 100% — suitable as ground truth reference	0.6.0
Autoware all-sensors-bag2	Autoware	ros2	86.1	69.8	74.6	GNSS fix rate 100% — suitable as ground truth reference	0.6.0
Autoware all-sensors-bag3	Autoware	ros2	81.7	78.7	59.0	GNSS fix rate 100% — suitable as ground truth reference	0.6.0
Autoware all-sensors-bag4	Autoware	ros2	72.2	—	—	Sensing/localization-only bag — planning/control topics absent	0.4.0
Autoware all-sensors-bag5	Autoware	ros2	84.5	77.9	64.5	GNSS fix rate 100% — suitable as ground truth reference	0.6.0
Autoware all-sensors-bag6	Autoware	ros2	77.0	40.3	69.9	GNSS fix rate 100% — suitable as ground truth reference	0.6.0
Autoware driving_20_kmh	Autoware	ros2	99.0	—	—	LiDAR packets and /vehicle/status/velocity_status at healthy rates	0.4.0
Autoware driving_30_kmh	Autoware	ros2	98.9	—	97.9	Sensor sync good (8.4ms)	0.6.0
Nav2 Gazebo (deep capture)	Nav2	ros2	100.0	100.0	—	Pipeline scan → costmap: 43ms median, 82ms P95	0.6.0
Nav2 Gazebo (goal capture)	Nav2	ros2	88.6	100.0	77.1	IMU accel noise 0.0170 m/s² — excellent, set imu_acc_noise_density to 0.0170	0.6.0
Nav2 Gazebo (headless)	Nav2	ros2	100.0	100.0	—	Control command (/cmd_vel) at 10Hz — good for control-loop observability	0.6.0
TurtleBot3 Walker	Nav2	ros2	100.0	100.0	—	Pipeline scan → costmap: 43ms median, 82ms P95	0.6.0
Franka Panda (MoveIt)	MoveIt	ros2	100.0	—	—	Pipeline joint_states → planned_path: 6ms median, 6ms P95 (1 sample)	0.6.0
MoveIt execution capture	MoveIt	ros2	100.0	—	—	Pipeline joint_states → planned_path: 6ms median, 6ms P95 (1 sample)	0.6.0
MoveIt plan-only capture	MoveIt	ros2	66.7	—	—	Pipeline joint_states → planned_path: 9ms median, 9ms P95 (1 sample)	0.6.0
NVIDIA r2b_galileo	Perception	ros2	90.5	—	—	Eight camera streams + IMU + chassis odom time-aligned	0.4.0
NVIDIA r2b_galileo2	Perception	ros2	95.3	—	—	RGB-D + infra + camera_info — reusable for perception export	0.4.0
NVIDIA r2b_whitetunnel	Perception	ros2	91.3	—	—	Multi-camera + IMU; noisy accel flagged but perception coverage solid	0.4.0
NVIDIA r2b_robotarm	Manipulation	ros2	96.0	—	—	RGB-D + joint_states suitable for arm perception benchmarking	0.4.0

Reproduce a row¶

Each manifest entry includes a reproduce command. Example:

export BAGX_SCOREBOARD_BAGS=/path/to/downloaded/bags
# also accepts BAGX_REALBAGS for NVIDIA / Autoware mirrors
bagx eval "$BAGX_SCOREBOARD_BAGS/r2b_galileo2"

Refresh the table after downloading bags:

export BAGX_SCOREBOARD_BAGS=/path/to/bags
python scripts/generate_scoreboard.py --refresh --write-manifest
python scripts/generate_scoreboard.py

Manifest: benchmarks/scoreboard.json

Sample commands by domain¶

SLAM

Newer College (handheld): bagx eval newer_college.db3
Livox MID-360: bagx eval livox_mid360.db3

Autoware

Autoware all-sensors-bag4: aws s3 sync s3://autoware-files/recordings/bags/2022-08-22_leo_drive_isuzu_bags/all-sensors-bag4_compressed/ ./all-sensors-bag4_compressed --no-sign-request && bagx eval all-sensors-bag4_compressed
Autoware driving_20_kmh: aws s3 sync s3://autoware-files/recordings/bags/2022-08-22_leo_drive_isuzu_bags/driving_20_kmh_2022_06_10-16_01_55_compressed/ ./driving_20 --no-sign-request && bagx eval driving_20

Nav2

TurtleBot3 Walker: bagx eval turtlebot3_walker
Nav2 Gazebo (deep capture): python scripts/run_ros_dogfood.py nav2-gazebo && bagx eval <captured_bag>

MoveIt

Franka Panda (MoveIt): bagx eval panda
MoveIt execution capture: python scripts/run_ros_dogfood.py moveit-demo && bagx eval <captured_bag>

Perception

NVIDIA r2b_galileo: bagx eval r2b_galileo
NVIDIA r2b_whitetunnel: bagx eval r2b_whitetunnel

Manipulation

NVIDIA r2b_robotarm: bagx eval r2b_robotarm