Router Ideas

Notation:

  • Tool -> what is orchestrated by the router
  • Router -> Large Language Models (LLMs) (does is have to be one?) that orchestrate
  • Agent = router in most cases

LLamaIndex:

  • tools have descriptions used by the agent to decide which to pick
  • inputs and outputs are strings
  • indexes, retrievers, response synthesis, and query engines can be used on top of the proprietary data.
  • we can use more than one tool at the same time (e.g. semantic search + summarization -> Colab)
  • What about memory? -> combine with LangChain (tutorial)
  • LlamaDebugHandler tracks events and timing stats

ReAct - Synergizing reasoning and acting in language models:

  • Solving process demonstrated by ReAct is more factual and grounded, whereas CoT is more accurate in formulating reasoning structure but can easily suffer from hallucinated facts or thoughts;
  • Best version is given by ReAct+CoT and external knowledge;
  • tested with GPT-3 and PaLM, what about other models?
  • How does fine-tuning works? Is it necessary?

LangChain:

  • it helps with instantiation of models, memory, agents, chains;
  • prompts: we can use template (even ones that have already been written) and few shot examples;
  • we can use indexes (with retriever)
  • we can define custom tool (and a tool can be another LLM)