Skip to content

Band-Aware Heuristics as Strong Baselines for RF Labeling

Introduction

In a recent paper, Benjamin James Gilbert proposes a simple yet effective set of band-aware labeling heuristics (BAR) for RF datasets. These rules map center frequency and spectral cues to labels like GSM, Wi-Fi, and GPS, providing a clean baseline for machine learning (ML) benchmarks.

Key Highlights

  • Transparent Rules: BAR uses frequency-rule sets that are easy to audit and reproduce, serving as a sanity check for dataset curation.
  • Competitive Performance: Despite its simplicity, BAR achieves a macro F1 score of 0.85, closely rivaling a reference ML classifier’s 0.90 F1.
  • Practical Utility: It acts as a strong fallback for underpowered models and helps diagnose issues like class leakage.

Methodology

The BAR approach relies on a first-hit rule set based on frequency bands and lightweight cues (e.g., bandwidth, hopping behavior). Rules are evaluated in order, with examples including Wi-Fi (2.4-2.5 GHz), GPS L1 (1.575-1.585 GHz), and LTE (700-3800 MHz).

Results

  • Overall F1: 0.87
  • Macro F1: 0.85
  • Per-class F1 scores: Wi-Fi (0.91), GSM (0.88), GPS (0.93), LTE (0.84), Bluetooth (0.86)

These metrics, auto-pulled from a single macros file, ensure consistency across the manuscript.

Takeaways

  • BAR provides a solid, transparent baseline for benchmarking ML across RF bands.
  • It highlights auditable failure modes, such as errors at band edges or with short dwell hoppers, serving as a curation signal.
  • The harness-first approach keeps all data in sync, preventing copy-paste errors.

Limitations

BAR is intentionally simple and band-centric, which can lead to false positives in dense urban settings or miss atypical bandwidths. However, it’s positioned as a baseline and curation tool, not a replacement for ML classifiers.

Conclusion

The release of BAR, along with its Python matcher script, offers a valuable resource for the RF community. It’s a stepping stone for more advanced ML models while ensuring transparency and reproducibility.

Leave a Reply

Your email address will not be published. Required fields are marked *