Not on top of my head, but there must be something. llama.cpp and vllm have basically solved the inference problem for LLMs. What you need is a RAG solution on top that also combines it with web search.
for coding tasks you need web search and RAG. It’s not the size of the model that matters, since even the largest models find solutions online.
This is a ‘‘The worst person you know just made a great point.’’ moment, isn’t it?
Swearing in source code points to a healthy and organic development.