List: Llama cpp | Curated by Sarin Suriyakoon | Medium

Sarin Suriyakoon

Mar 27, 2024

6 stories

Llama cpp

In

AWS in Plain English

by

Rustem Feyzkhanov

Guide for running Llama 2 using LLAMA.CPP on AWS Fargate

Step-by-step guide for deploying Llama 2 model to AWS using LLAMA.CPP as framework, Fargate for hardware and Copilot for deployment.

Oct 17, 2023

Guide for running Llama 2 using LLAMA.CPP on AWS Fargate

Oct 17, 2023

Vithushan Sylvester

Harnessing Llama CPP for Efficient HTTP Server Deployment of LLMs

Image by author via DALL-E 3

Oct 17, 2023

Harnessing Llama CPP for Efficient HTTP Server Deployment of LLMs

Oct 17, 2023

Maya Akim

How to Add ANY Model to Ollama

I have somewhat accidentally created a “smart” LLM with 7b parameters.

Mar 17, 2024

How to Add ANY Model to Ollama

Mar 17, 2024

Ingrid Stevens

Quantization of LLMs with llama.cpp

Understanding and Implementing n-bit Quantization Techniques for Efficient Inference in LLMs

Mar 15, 2024

Quantization of LLMs with llama.cpp

Mar 15, 2024

Peter Stevens

Run an LLM on Apple Silicon Mac using llama.cpp

Explore how to configure and experiment with large language models in your local environment

Dec 27, 2023

Run an LLM on Apple Silicon Mac using llama.cpp

Dec 27, 2023

In

Artificial Intelligence in Plain English

by

AI TutorMaster

Hugging Face Models + Llama.cpp = Faster AI🚀

How to Convert Models to GGUF Format

Mar 17, 2024

Hugging Face Models + Llama.cpp = Faster AI🚀

Mar 17, 2024

Sarin Suriyakoon
266 Followers
Following
Data Science Collective
J Kriwit
PALO IT THAILAND
The Medium Blog
Bill Yuchen Lin, PhD
See all (335)

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams