In the era of loT (Internet of Things) we are surrounded by a plethora of al enabled devices that can transcribe images, video, audio, and sensors signals into text descriptions. When such transcriptions are captured in activity reports for monitoring, life logging and anomaly detectio