Traditional single-modality sensing faces limitations in accuracy and capability, and its decoupled implementation with Communication Systems increases latency in bandwidth-constrained environments. Additionally, single-task-oriented sensing systems fail to address users' diverse deman