Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2403.16971

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 64

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 64
AutoDev: Automated AI-Driven Development

Paper • 2403.08299 • Published Mar 13 • 1

Papers - Agent - Operating Systems

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 64
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11 • 43

Papers - Agent - Memory

Cognitive Architectures for Language Agents

Paper • 2309.02427 • Published Sep 5, 2023 • 2
LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 64

Papers - Agent - Architecture

Cognitive Architectures for Language Agents

Paper • 2309.02427 • Published Sep 5, 2023 • 2
LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 64
Scaling Instructable Agents Across Many Simulated Worlds

Paper • 2404.10179 • Published Mar 13 • 25

To read... eventually

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 123
Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 49
MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

Paper • 2402.03766 • Published Feb 6 • 12
LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 64

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Paper • 2403.07816 • Published Mar 12 • 39
microsoft/phi-1_5

Text Generation • Updated Apr 29 • 93.2k • 1.31k
Language models scale reliably with over-training and on downstream tasks

Paper • 2403.08540 • Published Mar 13 • 14
Akashpb13/Swahili_xlsr

Automatic Speech Recognition • Updated Aug 27, 2023 • 354 • 8

SaulLM-7B: A pioneering Large Language Model for Law

Paper • 2403.03883 • Published Mar 6 • 74
Character-LLM: A Trainable Agent for Role-Playing

Paper • 2310.10158 • Published Oct 16, 2023 • 1
LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 64
RakutenAI-7B: Extending Large Language Models for Japanese

Paper • 2403.15484 • Published Mar 21 • 12

ibm/AttaQ

Viewer • Updated Jan 26 • 1.4k • 1.2k • 10
snorkelai/snorkel-curated-instruction-tuning

Preview • Updated Mar 11 • 2 • 9
corbyrosset/researchy_questions

Viewer • Updated Feb 29 • 96.4k • 56 • 24
argilla/ultrafeedback-binarized-preferences

Viewer • Updated Nov 30, 2023 • 63.6k • 647 • 64

Papers - Training Research

Measuring the Effects of Data Parallelism on Neural Network Training

Paper • 1811.03600 • Published Nov 8, 2018 • 2
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost

Paper • 1804.04235 • Published Apr 11, 2018 • 2
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

Paper • 1905.11946 • Published May 28, 2019 • 3
Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7 • 61

Previous
1
2
3
4
5
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs