LLM Modules: Knowledge Transfer from a Large to a Small Model Using Enhanced Cross-Attention | Research Publication | DSML Kazakhstan