Any time we are using an LLM internally, like to generate a summary, or to capture memory, this should be externalised and not in the code. This could be a custom system, or langfuse?