LLM Finetuning
Video Introduction Course Tutorial
/r/LocalLLaMA Yearly
📄 The creator of an uncensored local LLM posted here, WizardLM-7B-Uncensored, is being threatened and harassed on Hugging Face by a user named mdegans. Mdegans is trying to get him fired from Microsoft and his model removed from HF. He needs our support.
📄 OpenAI wants to crack down on open source LLMs, force through a government licensing system, and create a regulatory moat for themselves
📄 Microsoft makes new 1.3B coding LLM that outperforms all models on MBPP except GPT-4, reaches third place on HumanEval above GPT-3.5, and shows emergent properties
📄 Just put together a programming performance ranking for popular LLaMAs using the HumanEval+ Benchmark!
📄 Microsoft Research proposes new framework, LongMem, allowing for unlimited context length along with reduced GPU memory usage and faster inference speed. Code will be open-sourced
📄 The LLaMa publication is protected free speech under Bernstein v. United States - US Senators’ letter to Meta is entirely inappropriate – regulation of open source LLMs would be unconstitutional
📄 New quantization method AWQ outperforms GPTQ in 4-bit and 3-bit with 1.45x speedup and works with multimodal LLMs
📄 QLoRA: 4-bit finetuning of LLMs is here! With it comes Guanaco, a chatbot on a single GPU, achieving 99% ChatGPT performance on the Vicuna benchmark
📄 NTK-Aware Scaled RoPE allows LLaMA models to have extended (8k+) context size without any fine-tuning and minimal perplexity degradation.
📄 Having a 20 gig file that you can ask an offline computer almost any question in the world is amazing.
📄 New model just dropped: WizardCoder-15B-v1.0 model achieves 57.3 pass@1 on the HumanEval Benchmarks .. 22.3 points higher than the SOTA open-source Code LLMs.
📄 New quantization method SqueezeLLM allows for loseless compression for 3-bit and outperforms GPTQ and AWQ in both 3-bit and 4-bit. Quantized Vicuna and LLaMA models have been released.
Editor Recommended Sites
AI and Tech NewsBest Online AI Courses
Classic Writing Analysis
Tears of the Kingdom Roleplay
Idea Share: Share dev ideas with other developers, startup ideas, validation checking
Data Lineage: Cloud governance lineage and metadata catalog tooling for business and enterprise
Multi Cloud Tips: Tips on multicloud deployment from the experts
Faceted Search: Faceted search using taxonomies, ontologies and graph databases, vector databases.
Developer Key Takeaways: Dev lessons learned and best practice from todays top conference videos, courses and books