Local Models Related Links

This is just a backup from Local Models Related Links, it's based from this post: https://boards.4channel.org/g/thread/93323225#p93328754 It will not be updated.

/lmg/	Accelerate
Guides
LLaMa CPU/GPU guide	Entry guide for Nvidia GPU inferencing and general CPU inferencing
oobabooga ROCm Installation	AMD GPU inferencing
Example Fine Tune Walkthrough	Shows use of custom dataset and how to use it to fine tune a model
Example LoRa Walkthrough	Huggingface's StackLLaMa with their lora_config settings
4-bit LoRA Training Notebook	LoRA tune on Colab
Anon's LLaMa roleplay guide	For longer outputs more conducive to roleplay in TavernAI
Models
Huggingface	Generally the best place to find models. Link is for LLaMa currently
Curated Models Rentry	Overview of various models with links to various quantizations of them
Bellard's TS Server	Fabrice Bellard hosts a server with open models and a closed source way to run them
The-Eye	File host site that has a random assortment of ML resources
Papers
Local Models Papers Rentry	Other /lmg/ resource I keep up to date with new papers and articles
LabML.AI	Best way to find newly published papers
PapersWithCode	Good for catching trending papers based off Github stars
News
AI Explained	General AI news with well sourced links (Youtube)
Dr Alan D Thompson	Model reviews and AGI insights (Youtube)
Don't Worry About the Vase	Lesswrong cultist so prepare for "AI Bad" takes but does a good weekly AI news roundup (Blog)
SD Compendium	Stable Diffusion focused content with somewhat updated news (Wiki)
Info
Models Table	Google Sheet of models/major AI labs/other LLM information by Alan Thompson
Which GPU(s) to Get for Deep Learning	Tim Dettmer's continually updated blogpost
GPU inferencing Web UI Benchmarks	Outdated by now but still useful from Tom's Hardware
ML Glossary	From Google
List of Frameworks	Mostly for training Models from scratch. Maybe we'll get there someday
Andre Karpathy Videos	Former Tesla lead for AI (now at OpenAI). Builds models with explanation
Thread Template	Also has further resources and information
Previous Threads	Always good to search for previous questions before asking
Learn
The Principles of Deep Learning Theory	Give it a read even if you aren't sufficient with your math so you can get a feel of what is happening
Pen and Paper Exercises in Machine Learning	Do your homework
Huggingface NLP Course	Make sure to look at the other courses as well
Google's ML Course	Various courses related to ML
AttentionViz	Interactive tool that visualizes global attention patterns for transformer models
Diffusion Explainer	Interactive tool that explains how SD transforms text into images
Prompting
Prompt Engineering	Guide and current research on prompting by OpenAI's tech lead
OpenAI's Promptbook	ChatGPT/GPT-4 focused
LearnPrompting.org	Course and resources for prompting
PromptingGuide.Ai	Course and resources for prompting
Alpaca's Instruction	Image of the root verbs and objects for Alpaca specifically.
RPBT Prompt	Allows for OOC dialogue and for the bot to play as different NPCs
GPU Gits
Text Generation WebUI	Main GPU-based inferencing with extension support
Text Gen Extensions	Wiki link. Said wiki in general is excellent
TavernAI GPU Inferencing	Heavily modified TavernAI fork with WebUI API support
WebUI Context Hack	Forces a GC every 8 tokens in streaming mode
CPU Gits
llama.cpp	Main CPU-based inferencing
kobold.cpp	llama.cpp fork with Kobold UI
gpt-llama.cpp	llama.cpp fork that also replaces OpenAi's GPT APIs
Serge	llama.cpp chat interface. SvelteKit frontend, MongoDB
Alpaca Electron	llama.cpp chat interface.
Llama Server	llama.cpp Chat interface. Chatbot UI
Whisper.cpp	Speach-to-text CPU-based inferencing
Turbopilot	WIP. Copilot clone using llama.cpp to run Codegen 6B
Local Related Gits
AutoGPTQ	4bit weight quantization for bloom, gpt_neox(StableLM), gptj, llama and opt models
RPTQ for LLaMa	WIP implementation of weight+activation quantization
LLaMa Pruning	WIP. Various techniques to prune (zero weights) LLaMa models (needs post-training)
Basaran	OS alternative to the OpenAI text completion API
Langchain	Set of resources to maximize LLMs Chains/tool integrations/agents/etc.
Langchain Tutorials	Guide to get started and how to use. Youtube videos are also a good resource here
Local LLM Langchain	Experimental extension for WebUI with langchain support for notebook
LMQL	Query language for programming LLMs
LLaMa Index	Central interface to connect LLM's with external data
LLaMa Hub	Simple library of all the data loaders/readers for llama index/langchain
LLM Adapters	PEFT library adapters that work on LLaMA and other models
LMFlow	Similar as above
Alpaca LoRa 4bit	Should be best to use LoRa on the 4bit model in this case LLaMa
Rank Response from Human Feedback	Easier alignment tuning method
Shell GPT	Command-line productivity tool works though OpenAI API (local with Basaran)
Segment Anything WebUI	SAM webui (GPU inferenced). Georgi seems he might do a SAM.cpp
Bark with voice clone	Text-to-audio transformer based model with CPU/GPU inference
RVC	Retrieval based Voice Conversation model
AudioGPT	Suite of various audio related foundational models for use with a LLM (use basaran for local)
ComfyUI	Node based stable diffusion GUI
Vlad's SD WebUI fork	Fork of Automatic1111 stable diffusion webui with active development
Datasets
Huggingface	Best source for datasets
ShareGPT Unfiltered v4	Removed refusals, excessive unicode, excessive repeats
Evol Instruct Unfiltered	Removed refusals, blatant alignment, blanks
GPTeacher	Collection of modular datasets generated by GPT-4
GPT4 4 LLM	Alpaca style self-instruct technique using GPT4 also with chinese version
Music AI Voice	For use with RVC or SVC audio voice cloning
Wikipedia Embeddings	Done by Cohere. link is their blog with some suggested use cases
Coomer Forums Scrape Rentry	Raw RP/ERP/ELIT content

Local Models Related Links

Warning