"topic:self-consistency" — Search

20 results for “topic:self-consistency”

Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.

Jupyter Notebook1734Updated 1 year ago

attention-headchain-of-thoughtdata-augmentationdecodinghallucinationinternal-consistencyknowledge-distillationlarge-language-modellarge-language-modelspreference-learningreasoningself-consistencyself-correctself-correctionself-feedbackself-improvementself-refine

SuperBruceJia/Awesome-LLM-Self-Consistency

Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models

12010Updated 8 months ago

chain-of-thoughtchatgptcompositional-consistencyfactual-consistencygpt-3gpt-4hypothetical-consistencyllmsllms-reasoninglogical-consistencypretrained-language-modelreasoningself-consistencyself-consistency-benchmarkself-consistency-learningself-consistent-generationsemanticssemantics-consistencysemantics-preserving

CycloneBoy/csc_sql

CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning

Python597Updated 7 months ago

grpoself-consistencytext-to-sql

Amirhosein-gh98/Guided-by-Gut

The official PyTorch implementation for the Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence

Python90Updated 9 months ago

confidencedvtsefficientgggrpoguided-by-gutinference-time-computellmprmrl-trainingself-consistencytest-time-scalingtree-search

Toronto-Condensed-Matter-Theory/MeanFieldToolkit.jl

Package for solving generalized BdG mean field theory of interacting systems.

Julia73Updated 1 year ago

condensed-matter-physicsground-state-energyinteracting-particle-systemjulia-packagelatticemagnetismmean-field-theoryself-consistencysuperconductivity

ictup/Enhancing-QA-Systems-through-Integrated-Reasoning-over-Knowledge-Bases-and-Large-Language-Models

KG-RAG + ToT + multi-agent LLMs for evidence-grounded QA with Neo4j and fine-tuning; reproducible medical case study & evaluation.

Python41Updated 7 months ago

autogenbm25fine-tuningknowledge-graphllmllm-rankingloramindmapneo4jnlipeftprompt-engineeringquestion-answeringragreasoningself-consistencytree-of-thoughts

snikol03/Markov-Chain-Project

Perl implementation of Markov Chain for the course BIO331

Perl21Updated 8 years ago

biomedicalbiomedical-data-sciencebiomedical-informaticsconsistencycross-validationdnadna-sequencesfastaidentificationmarkovmarkov-chainmarkov-modelperlperl5perl6proteinprotein-sequencesrnarna-seqself-consistency

Toronto-Condensed-Matter-Theory/FixedPointToolkit.jl

Fixed Point solver for generic functions

Julia22Updated 2 years ago

condensed-matter-physicsfixed-pointjulia-packagephysicsself-consistency

SuperBruceJia/GSM8K-Consistency

GSM8K-Consistency is a benchmark database for analyzing the consistency of Arithmetic Reasoning on GSM8K.

20Updated 2 years ago

arithmetic-consistencyarithmetic-reasoningfactual-consistencyfoundation-modelsgradegrade-school-mathgsm8klarge-language-modelslogical-consistencymathematical-reasoningpromptprompt-engineeringprompt-perturbationprompt-toolkitreasoningself-consistencyself-consistency-benchmarksemantics-consistencysemantics-preserving-transformationssemantics-similar

giselamarti/Electronic_Structure

Subject of Electronic structure for my master's degree

Python10Updated 2 years ago

computational-physicsgaussian-orbitalsground-state-energyhydrogen-atompythonquantum-mechanicsself-consistency

msmrexe/llm-math-reasoning-analysis

An evaluation of prompting techniques (Zero-Shot CoT, Few-Shot, Self-Consistency) on the Mistral-7B model for mathematical reasoning. This project systematically benchmarks 7 distinct methods on the GSM8K dataset.

Python10Updated 4 months ago

chain-of-thoughtchain-of-thought-reasoningcourse-projectdeep-learningfew-shotgsm8khuggingface-transformerslarge-language-modelsllmllm-evaluationmajority-votingmathematical-reasoningmistral-7bprompt-engineeringquestion-decompositionreasoningreasoning-language-modelsself-consistencyuniversity-projectzero-shot

TEJA4704/prompt-engineering-toolkit

Advanced prompt engineering techniques: Chain-of-Thought, Tree-of-Thoughts, ReAct, Self-Consistency

Python10Updated 1 month ago

ai-agentschain-of-thoughtlangchain-alternativellmprompt-engineeringpythonreact-agentreasoningself-consistencytree-of-thoughts

sjain-stanford/SCM-PLL

Self consistent model based filter design for 3-phase PLLs.

Makefile10Updated 8 years ago

filter-designmathematical-modellingmatlabpllself-consistencysimulink-model

jameswniu/self-hosted-llm-evals-lab

Evaluation framework for self-hosted LLMs. Systematic prompt ablation (baseline, CoT, few-shot, self-consistency voting) on Llama 3.1 8B via lm-evaluation-harness, with Wilson CI statistical analysis, determinism validation, and load testing under concurrency. Found chain-of-thought degrades accuracy 25pp at small scale.

Python10Updated 1 week ago

ablation-studybenchmarkchain-of-thoughtdeterminismllamallm-evaluationlm-eval-harnessload-testingnatural-language-processingollamaprompt-engineeringself-consistencyself-hostedstatistical-analysis

Dr-AneeshJoseph/blast-rag-firewall

A consistency-based firewall for high-stakes Retrieval Augmented Generation (RAG). Queries the model multiple times and incinerates the output if entropy is high (divergent answers), preferring silence over hallucination.

Python00Updated 3 months ago

entropy-checkinghallucination-firewalllegal-aimedical-airagreliability-engineeringself-consistency

brendancsmith/panel-of-experts

10 stochastic parrots are better than 1 🦜

Python00Updated 1 year ago

chainlitchatbotgptlangchainllmopenaiself-consistency

Saharsh1005/autonomous-prompting

Developing an autonomous system for prompt selection for Large Language Models (LLMs), enhancing performance across tasks by balancing generality and specificity. This project automates diverse, high-quality prompt creation and selection, reducing manual intervention and maximizing LLM utility across applications.

Jupyter Notebook01Updated 1 year ago

chain-of-thoughtcotgsm8klarge-language-modelsllmprompt-engineeringself-consistency

Pranav-here/llm-reasoning-playground

Interactive Streamlit application that benchmarks direct prompting, chain-of-thought, self-consistency, tree-of-thought and reflexion techniques across OpenAI GPT-3.5 and Groq Gemma-9B-IT.

Python00Updated 8 months ago

chain-of-thoughtgroqllmopenaireasoningreflexionself-consistencystreamlittree-of-thoughts

quantum-lichen/KERNEL-v4-Cognitive-Scheduler

KERNEL v4.2 est un scheduler cognitif adaptatif pour LLMs intégrant LATS, Reflexion et Meta-Prompting. Il optimise dynamiquement le routing du raisonnement selon la complexité (1-5) pour maximiser la précision et réduire les hallucinations. Propulsé par la vision Lichen-Collectives.

00Updated 1 month ago

adaptive-routingai-alignmentartificial-intelligencebryan-ouellettechain-of-thoughtcognitive-architectureinnovationlatslichen-collectiveslichen-universellm-reasoninglogicmachine-learningmeta-promptingopen-sourceprompt-engineeringreflexivityresearch-frameworkself-consistencysystems-thinking

dsikkema/i-heart-papers

Demos and walkthroughs of published Machine Learning research papers

Jupyter Notebook00Updated 1 year ago

diffeditmachine-learningself-consistency