This is just a backup from Local Models Related Links, it's based from this post: https://boards.4channel.org/g/thread/93323225#p93328754 It will not be updated.
Local Models Related Links
/lmg/ | Accelerate |
---|---|
Guides | |
LLaMa CPU/GPU guide | Entry guide for Nvidia GPU inferencing and general CPU inferencing |
oobabooga ROCm Installation | AMD GPU inferencing |
Example Fine Tune Walkthrough | Shows use of custom dataset and how to use it to fine tune a model |
Example LoRa Walkthrough | Huggingface's StackLLaMa with their lora_config settings |
4-bit LoRA Training Notebook | LoRA tune on Colab |
Anon's LLaMa roleplay guide | For longer outputs more conducive to roleplay in TavernAI |
Models | |
Huggingface | Generally the best place to find models. Link is for LLaMa currently |
Curated Models Rentry | Overview of various models with links to various quantizations of them |
Bellard's TS Server | Fabrice Bellard hosts a server with open models and a closed source way to run them |
The-Eye | File host site that has a random assortment of ML resources |
Papers | |
Local Models Papers Rentry | Other /lmg/ resource I keep up to date with new papers and articles |
LabML.AI | Best way to find newly published papers |
PapersWithCode | Good for catching trending papers based off Github stars |
News | |
AI Explained | General AI news with well sourced links (Youtube) |
Dr Alan D Thompson | Model reviews and AGI insights (Youtube) |
Don't Worry About the Vase | Lesswrong cultist so prepare for "AI Bad" takes but does a good weekly AI news roundup (Blog) |
SD Compendium | Stable Diffusion focused content with somewhat updated news (Wiki) |
Info | |
Models Table | Google Sheet of models/major AI labs/other LLM information by Alan Thompson |
Which GPU(s) to Get for Deep Learning | Tim Dettmer's continually updated blogpost |
GPU inferencing Web UI Benchmarks | Outdated by now but still useful from Tom's Hardware |
ML Glossary | From Google |
List of Frameworks | Mostly for training Models from scratch. Maybe we'll get there someday |
Andre Karpathy Videos | Former Tesla lead for AI (now at OpenAI). Builds models with explanation |
Thread Template | Also has further resources and information |
Previous Threads | Always good to search for previous questions before asking |
Learn | |
The Principles of Deep Learning Theory | Give it a read even if you aren't sufficient with your math so you can get a feel of what is happening |
Pen and Paper Exercises in Machine Learning | Do your homework |
Huggingface NLP Course | Make sure to look at the other courses as well |
Google's ML Course | Various courses related to ML |
AttentionViz | Interactive tool that visualizes global attention patterns for transformer models |
Diffusion Explainer | Interactive tool that explains how SD transforms text into images |
Prompting | |
Prompt Engineering | Guide and current research on prompting by OpenAI's tech lead |
OpenAI's Promptbook | ChatGPT/GPT-4 focused |
LearnPrompting.org | Course and resources for prompting |
PromptingGuide.Ai | Course and resources for prompting |
Alpaca's Instruction | Image of the root verbs and objects for Alpaca specifically. |
RPBT Prompt | Allows for OOC dialogue and for the bot to play as different NPCs |
GPU Gits | |
Text Generation WebUI | Main GPU-based inferencing with extension support |
Text Gen Extensions | Wiki link. Said wiki in general is excellent |
TavernAI GPU Inferencing | Heavily modified TavernAI fork with WebUI API support |
WebUI Context Hack | Forces a GC every 8 tokens in streaming mode |
CPU Gits | |
llama.cpp | Main CPU-based inferencing |
kobold.cpp | llama.cpp fork with Kobold UI |
gpt-llama.cpp | llama.cpp fork that also replaces OpenAi's GPT APIs |
Serge | llama.cpp chat interface. SvelteKit frontend, MongoDB |
Alpaca Electron | llama.cpp chat interface. |
Llama Server | llama.cpp Chat interface. Chatbot UI |
Whisper.cpp | Speach-to-text CPU-based inferencing |
Turbopilot | WIP. Copilot clone using llama.cpp to run Codegen 6B |
Local Related Gits | |
AutoGPTQ | 4bit weight quantization for bloom, gpt_neox(StableLM), gptj, llama and opt models |
RPTQ for LLaMa | WIP implementation of weight+activation quantization |
LLaMa Pruning | WIP. Various techniques to prune (zero weights) LLaMa models (needs post-training) |
Basaran | OS alternative to the OpenAI text completion API |
Langchain | Set of resources to maximize LLMs Chains/tool integrations/agents/etc. |
Langchain Tutorials | Guide to get started and how to use. Youtube videos are also a good resource here |
Local LLM Langchain | Experimental extension for WebUI with langchain support for notebook |
LMQL | Query language for programming LLMs |
LLaMa Index | Central interface to connect LLM's with external data |
LLaMa Hub | Simple library of all the data loaders/readers for llama index/langchain |
LLM Adapters | PEFT library adapters that work on LLaMA and other models |
LMFlow | Similar as above |
Alpaca LoRa 4bit | Should be best to use LoRa on the 4bit model in this case LLaMa |
Rank Response from Human Feedback | Easier alignment tuning method |
Shell GPT | Command-line productivity tool works though OpenAI API (local with Basaran) |
Segment Anything WebUI | SAM webui (GPU inferenced). Georgi seems he might do a SAM.cpp |
Bark with voice clone | Text-to-audio transformer based model with CPU/GPU inference |
RVC | Retrieval based Voice Conversation model |
AudioGPT | Suite of various audio related foundational models for use with a LLM (use basaran for local) |
ComfyUI | Node based stable diffusion GUI |
Vlad's SD WebUI fork | Fork of Automatic1111 stable diffusion webui with active development |
Datasets | |
Huggingface | Best source for datasets |
ShareGPT Unfiltered v4 | Removed refusals, excessive unicode, excessive repeats |
Evol Instruct Unfiltered | Removed refusals, blatant alignment, blanks |
GPTeacher | Collection of modular datasets generated by GPT-4 |
GPT4 4 LLM | Alpaca style self-instruct technique using GPT4 also with chinese version |
Music AI Voice | For use with RVC or SVC audio voice cloning |
Wikipedia Embeddings | Done by Cohere. link is their blog with some suggested use cases |
Coomer Forums Scrape Rentry | Raw RP/ERP/ELIT content |