As mobile devices increasingly become focal points for advanced applications, edge computing presents a viable solution to their inherent computational limitations, particularly in deploying Large Language Models (LLMs). However, despite the advancements in edge computing, significant