Achieving an robust and universal semantic representation for action description remains the key challenge in natural more info language understanding. Current approaches often struggle to capture the subtlety of human actions, leading to inaccurate representations. To address this challenge, we propose innovative framework that leverages hybrid le