The terminal chat application is a basic example of how to use the harmony format together with the PyTorch, Triton, and vLLM implementations. Along with the model, we are also releasing a new chat format library harmony to interact with the model. This implementation is not production-ready but is accurate to the PyTorch implementation. Download […]