Smart Language Models Research Notes - doraithodla/notes GitHub Wiki
Smart language models (SLMs) are an alternative way of using Natural Language Processing, a sibling of the Large Language Models (LLMs) produced by companies like Google and OpenAI. LLMs have some powerful upsides – emergent capabilities; extensive general knowledge; and plausible, ‘human-sounding’ text – but there are a variety of engineering approaches, including SLMs, that can drive value in the enterprise.
Smart language models, composed of millions of parameters as opposed to billions, adopt this approach. They start with the business use-case and then work backwards to build a model that can complete that task with a high degree of accuracy, based on its comprehensive training in that field.