Research into adding persistent linear memory to frozen pretrained language models using Contextual Delta Distillation (CDD), without modifying original weig...