InAWS in Plain EnglishbyRustem FeyzkhanovGuide for running Llama 2 using LLAMA.CPP on AWS FargateStep-by-step guide for deploying Llama 2 model to AWS using LLAMA.CPP as framework, Fargate for hardware and Copilot for deployment.Oct 17, 20232Oct 17, 20232
Vithushan SylvesterHarnessing Llama CPP for Efficient HTTP Server Deployment of LLMsImage by author via DALL-E 3Oct 17, 20231Oct 17, 20231
Maya AkimHow to Add ANY Model to OllamaI have somewhat accidentally created a “smart” LLM with 7b parameters.Mar 17, 20243Mar 17, 20243
Ingrid StevensQuantization of LLMs with llama.cppUnderstanding and Implementing n-bit Quantization Techniques for Efficient Inference in LLMsMar 15, 20249Mar 15, 20249
Peter StevensRun an LLM on Apple Silicon Mac using llama.cppExplore how to configure and experiment with large language models in your local environmentDec 27, 2023Dec 27, 2023
InArtificial Intelligence in Plain EnglishbyAI TutorMasterHugging Face Models + Llama.cpp = Faster AI🚀How to Convert Models to GGUF FormatMar 17, 2024Mar 17, 2024