List: Llm | Curated by Sarin Suriyakoon

Apr 27, 2024
34 stories
Llm
In
AWS in Plain English
by
Rustem Feyzkhanov
Serverless compute for LLM — with a step-by-step guide for hosting Mistral 7B on AWS LambdaOne of the challenges with using LLM in production is finding the right way to host the models in the cloud. GPUs are expensive so hosting…
Nov 16, 2023
7
Nov 16, 2023
7
Intel(R) Neural Compressor
Highly-efficient LLM Inference on Intel PlatformsLeadership performance yet compatible with llama.cpp
Oct 20, 2023
1
Oct 20, 2023
1
In
Government Digital Products, Singapore
by
Terence Lucas Yap
From Conventional RAG to Graph RAGWhen Large Language Models Meet Knowledge Graphs
Mar 16, 2024
9
Mar 16, 2024
9
In
Artificial Intelligence in Plain English
by
AI TutorMaster
Hugging Face Models + Llama.cpp = Faster AI🚀How to Convert Models to GGUF Format
Mar 17, 2024
Mar 17, 2024
In
TDS Archive
by
Maxime Labonne
Merge Large Language Models with mergekitCreate your own models easily, no GPU required!
Jan 8, 2024
17
Jan 8, 2024
17
Peter Stevens
Run an LLM on Apple Silicon Mac using llama.cppExplore how to configure and experiment with large language models in your local environment
Dec 27, 2023
Dec 27, 2023
Ingrid Stevens
Quantization of LLMs with llama.cppUnderstanding and Implementing n-bit Quantization Techniques for Efficient Inference in LLMs
Mar 15, 2024
9
Mar 15, 2024
9
In
TDS Archive
by
Ian Ho
Automated Prompt EngineeringA mixture of reflections, lit reviews and an experiment (just for fun) on Automated Prompt Engineering for Large Language Models
Mar 10, 2024
12
Mar 10, 2024
12
In
Superteams.ai
by
Akriti Upadhyay
How to Build an Advanced AI-Powered Enterprise Content Pipeline Using Mixtral 8x7B and QdrantBy Akriti Upadhyay
Feb 19, 2024
1
Feb 19, 2024
1
In
WhyHow.AI
by
Chia Jeng Yang
Why Gemini 1.5 (and other large context models) are bullish for RAGOptimization via RAG: How to overcome Accuracy, Cost, Latency and other performance limitations of large context models.
Feb 18, 2024
8
Feb 18, 2024
8
Bryan McKenney
Teaching LLMs to Think and Act: ReAct Prompt EngineeringTL;DR
Jun 9, 2023
3
Jun 9, 2023
3
Yaduvanshiharsh
Build Your Own Chatbot Assistant in 2 Minutes with Hugging FaceHugging Face has introduced Assistant, built on top of Hugging Chat. With this platform, you can create your own chatbot assistant in under…
Feb 17, 2024
Feb 17, 2024
In
Level Up Coding
by
Youssef Hosni
14 Free Large Language Models Fine-Tuning NotebooksGetting Started with LLM Fune-Tunning through These Free Colab Notebooks
Feb 6, 2024
2
Feb 6, 2024
2
Dr. Ernesto Lee
Implementing AI in Software Testing: Creating a Text Generation Model for Test AutomationIntroduction
Feb 6, 2024
5
Feb 6, 2024
5
In
Level Up Coding
by
Fareed Khan
Free GenAI APIs You Can Use in 2024Exploring the Latest Free GenAI APIs
Feb 4, 2024
2
Feb 4, 2024
2
In
TDS Archive
by
Anthony Alcaraz
Enhanced Large Language Models as Reasoning EnginesThe recent exponential advances in natural language processing capabilities from large language models (LLMs) have stirred tremendous…
Dec 23, 2023
3
Dec 23, 2023
3
Aaweg I
Improving RAG: Self Querying RetrievalKEEP IN TOUCH | THE GEN AI SERIES
Feb 11, 2024
Feb 11, 2024
Sean Ryan
Extracting website CSS styling from a screenshot via the LLaVA LLM and prompt engineeringThe LLaVa LLM is a Large Language Model that can accept images as input. This opens up many possiblities…
Feb 11, 2024
Feb 11, 2024
In
Generative AI
by
Fabio Matricardi
Stop! Don’t read this until you get your LLM Under ControlLearn crucial “stop words” to avoid information overload and unlock focused conversations with your Large Language Model.
Feb 10, 2024
1
Feb 10, 2024
1
In
Level Up Coding
by
Gao Dalie (高達烈)
LangGraph + Gemini Pro + Custom Tool + Streamlit = Multi-Agent Application DevelopmentIn this post, you are going to learn how we can create this chatbot in LangGraph, Gemini Pro or any model you like, Custom Function and…
Feb 4, 2024
5
Feb 4, 2024
5