mrfakename (mrfakename)

upvoted 2 papers about 2 months ago

Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15 • 52

Artist: Aesthetically Controllable Text-Driven Stylization without Training

Paper • 2407.15842 • Published Jul 22 • 13

upvoted 2 papers 4 months ago

Diffusion On Syntax Trees For Program Synthesis

Paper • 2405.20519 • Published May 30 • 1

"Teach AI How to Code": Using Large Language Models as Teachable Agents for Programming Education

Paper • 2309.14534 • Published Sep 25, 2023 • 2

upvoted a paper 5 months ago

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Paper • 2402.09844 • Published Feb 15 • 20

upvoted a collection 5 months ago

🎭 Avatars

Collection

The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 66 items • Updated 7 days ago • 74

upvoted an article 5 months ago

Article

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Jan 26, 2023

• 32

upvoted 2 papers 5 months ago

Better speech synthesis through scaling

Paper • 2305.07243 • Published May 12, 2023 • 5

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 250

upvoted 2 articles 5 months ago

Article

Mixture of Depth is Vibe

By

•

Apr 22

• 43

Article

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

Feb 27

• 29

upvoted 2 papers 5 months ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 83

Natural language guidance of high-fidelity text-to-speech with synthetic annotations

Paper • 2402.01912 • Published Feb 2 • 11

upvoted an article 5 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 157

upvoted 6 papers 6 months ago

Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 25

upvoted 9 papers 7 months ago

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6 • 63

Beyond Language Models: Byte Models are Digital World Simulators

Paper • 2402.19155 • Published Feb 29 • 49

Nemotron-4 15B Technical Report

Paper • 2402.16819 • Published Feb 26 • 42

A Language Model's Guide Through Latent Space

Paper • 2402.14433 • Published Feb 22 • 1

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis

Paper • 2402.14797 • Published Feb 22 • 19

DiffiT: Diffusion Vision Transformers for Image Generation

Paper • 2312.02139 • Published Dec 4, 2023 • 13

BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15 • 17

GhostWriter: Augmenting Collaborative Human-AI Writing Experiences Through Personalization and Agency

Paper • 2402.08855 • Published Feb 13 • 9

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Paper • 2402.08093 • Published Feb 12 • 54

upvoted 2 papers 8 months ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 120

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

Paper • 2401.12070 • Published Jan 22 • 42

upvoted a collection 8 months ago

AIM

Collection

AIM: Autoregressive Image Models • 5 items • Updated Jun 19 • 48

upvoted 2 papers 8 months ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 140

Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Paper • 2312.09911 • Published Dec 15, 2023 • 52

upvoted a collection 9 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 211

upvoted 3 papers 9 months ago

SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

Paper • 2312.15166 • Published Dec 23, 2023 • 56

VecFusion: Vector Font Generation with Diffusion

Paper • 2312.10540 • Published Dec 16, 2023 • 21

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Paper • 2306.07691 • Published Jun 13, 2023 • 4

upvoted 2 papers 10 months ago

Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 47

ChatAnything: Facetime Chat with LLM-Enhanced Personas

Paper • 2311.06772 • Published Nov 12, 2023 • 34

upvoted 2 papers 11 months ago

Controllable Music Production with Diffusion Models and Guidance Gradients

Paper • 2311.00613 • Published Nov 1, 2023 • 24

CodeFusion: A Pre-trained Diffusion Model for Code Generation

Paper • 2310.17680 • Published Oct 26, 2023 • 69

upvoted a paper about 1 year ago

Learning to Model the World with Language

Paper • 2308.01399 • Published Jul 31, 2023 • 34

mrfakename PRO

AI & ML interests

Articles

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

Organizations

mrfakename's activity

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Mixture of Depth is Vibe

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

Mixture of Experts Explained