"topic:scaling-laws" — Search

Code for reproducing the experiments on large-scale pre-training and transfer learning for the paper "Effect of large-scale pre-training on full and few-shot transfer learning for natural and medical images" (https://arxiv.org/abs/2106.00116)

Jupyter Notebook194Updated 2 months ago

big-transferchest-x-ray14chest-xray-imageschexpert-datasetcovidx-datasetdeep-learningdistributed-trainingfew-shot-learningfine-tuningimagenetlarge-scale-learningmedical-imagingmimic-cxrpadchest-datasetpre-trained-modelpre-trainingpytorchscaling-lawssupercomputingtransfer-learning

Leading-AI-IO/the-silence-of-intelligence

The Silence of Intelligence — A comprehensive analysis of Anthropic CEO Dario Amodei's philosophy on Scaling Laws, AI safety, and the future of humanity. / Anthropic CEO ダリオ・アモディの思想を体系化したOSS書籍。スケーリング則の本質とAIの未来を解き明かす。

180Updated 1 week ago

ai-constitutionai-philosophyai-safetyanthropicartficial-intelligencebookclaudeclaude-codecoworkdario-amodeillmmachine-learningopen-sourcescaling-laws

machinelearningnuremberg/DPL

[NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benchmarks.

Python163Updated 10 months ago

benchmarkcomputer-visiondeep-learninghpohyperparameter-optimizationhyperparameter-tuninglarge-language-modelsllmmachine-learningmlpnatural-language-processingneuripsneurips-2023power-lawsscaling-lawstabular-datatransformer

VITA-Group/Data-Efficient-Scaling

[ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang

Python140Updated 1 year ago

data-efficientlarge-language-modelsmodel-reusingscaling-laws

Taishi-N324/Awesome-RL-Reasoning

Awesome-RL-Reasoning

140Updated 1 week ago

large-language-modelsllm-reasoningreasoningreinforcement-learningscaling-laws

benjaminnNgo/ScalingTGNs

No description provided.

Python111Updated 2 weeks ago

foundation-modelsscaling-lawstemporal-graph-neu

supersimple33/Scaling-Laws

A method for calculating scaling laws for LLMs from publicly available models

Python90Updated 1 year ago

large-language-modelsscaling-laws

christinakim/scaling-laws-for-language-transfer

code for Scaling Laws for Language Transfer Learning

Python91Updated 1 year ago

fine-tuninghuggingface-transformerslanguage-modelopenaipre-trained-modelpytorchpytorch-lightningscaling-lawstransfer-learningtransformers

rraghavkaushik/NLP-Reading-List

A curated collection of NLP and LLM resources. Covers essential papers and blogs on Transformers, Reinforcement Learning (RLHF, DPO, GRPO), Mechanistic Interpretability, Scaling Laws, and MLSys.

90Updated 1 month ago

awesome-llm-resourcesdeep-learninglatest-llm-papersllm-papersllmsmechanistic-interpretabilitymlsysnatural-language-processingnlp-learning-resourcesnlp-papersreinforcement-learningresearch-papersresearch-papers-collectionrlhfscaling-lawstransformers

KomeijiForce/Cuckoo

[ACL2025 Oral] Cuckoo: A Series of IE Free Riders Using LLM's Resources to Scale up Themselves.

Python80Updated 5 months ago

cuckooinformation-extractionscaling-laws

otvam/pyscalexfmr

Optimization and Scaling of Medium-Frequency Transformers

Python80Updated 1 month ago

dc-dc-converterdual-active-bridgemedium-frequencyopen-source-softwarepower-electronicspythonscaling-lawssingle-phasethree-phasetransformers

rioyokotalab/optimal-sparsity

[ICLR 2026 Oral] Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Python70Updated 4 days ago

large-language-modelslarge-scalemixture-of-expertsmoescaling-lawssparse-mixture-of-experts

upunaprosk/small-language-models

Code for CoNLL BabyLM workshop Mini Minds: Exploring Bebeshka and Zlata Baby Models

Jupyter Notebook62Updated 9 months ago

gpt-2graphcoreipulanguage-modelpretrainingrobertascaling-laws

SafeRL-Lab/PAPerBench

Long Context, Less Focus: A Scaling Gap in LLMs Revealed through Privacy and Personalization

Python41Updated 1 week ago

benchmarkcontext-lengthllmspersonalizationprivacysafetyscaling-laws

linhaowei1/Fine-tuning-Scaling-Law

🌹[ICML 2024] Selecting Large Language Model to Fine-tune via Rectified Scaling Law

Python40Updated 6 months ago

fine-tunelanguage-modelllmmachine-learningmodel-selectionnlpscaling-laws

RylanSchaeffer/KoyejoLab-Large-How-Do-Language-Monkey-Power-Get-Their-Power

Code for ICML 2025 How Do Large Language Monkeys Get Their Power (Laws)?

Python20Updated 6 months ago

evaluationsinference-computelanguage-modelsnatural-language-processingscaling-inference-computescaling-lawsscaling-predictable-evaluationsscaling-test-time-computetest-time-compute

sandyherho/dla-ideal-solver

A high-performance Python library for simulating Diffusion-Limited Aggregation (DLA) with Numba JIT acceleration, parallel rendering, and automated fractal dimension analysis of dendritic growth patterns.

Python20Updated 1 month ago

complexitydiffusion-limited-aggregationfractalslattice-modelscaling-laws

jimmyjdejesus-cmyk/agent-scaling-laws

🔬 Implementation of agent coordination architectures and scaling principles from 'Towards a Science of Scaling Agent Systems' (arXiv:2512.08296). Research-backed multi-agent framework with benchmarks and validation.

Python20Updated 2 months ago

agent-frameworkaiai-agentsarxivcoordinationdeep-learningdistributed-systemsmachine-learningmasmultiagent-systemspythonresearchscaling-laws

Page 1 of 2