"topic:nshkr-crucible" — Search

28 results for “topic:nshkr-crucible”

AI Firewall and guardrails for LLM-based Elixir applications

aiai-firewallai-safetybeamcontent-filteringelixirensemble-methodsguardrailsllmllm-securitymachine-learningnshkr-crucibleotpprompt-injectionreliabilityresearchsafety-constraintssecuritysecurity-frameworkstatistical-testing

North-Shore-AI/crucible_examples

Interactive Phoenix LiveView demonstrations of the Crucible Framework - showcasing ensemble voting, request hedging, statistical analysis, and more with mock LLMs

Elixir10Updated 3 months ago

aibeamdemoeducationelixirensemble-methodsexamplesinteractiveliveviewllmllm-reliabilitymachine-learningnshkr-crucibleotpphoenixphoenix-frameworkreal-timestatistical-testingtutorialvisualization

North-Shore-AI/ExFairness

Fairness and bias detection library for Elixir AI/ML systems

Elixir10Updated 2 months ago

aiai-fairnessalgorithmic-fairnessbeambias-detectionbias-mitigationelixirensemble-methodsethical-aifairnessllmmachine-learningml-fairnessnshkr-crucibleotpreliabilityresearchresponsible-aistatistical-testing

North-Shore-AI/crucible_ir

Intermediate Representation for the Crucible ML reliability ecosystem

Elixir00Updated 2 months ago

nshkr-crucible

North-Shore-AI/crucible_kitchen

Industrial ML training orchestration - backend-agnostic workflow engine for supervised, reinforcement, and preference learning. Provides composable workflows, declarative stage DSL, comprehensive telemetry, and port/adapter patterns for any ML backend. The missing orchestration layer that makes ML cookbooks trivially thin.

Elixir00Updated 2 months ago

beamdeep-learningdpoelixirfine-tuningllmmachine-learningml-trainingmlopsnshkr-crucibleorchestrationotpreinforcement-learningsupervised-learningtelemetryworkflow-engine

North-Shore-AI/ExDataCheck

Data validation and quality library for ML pipelines in Elixir

Elixir00Updated 3 months ago

aibeamdata-checkingdata-integritydata-qualitydata-validationelixirensemble-methodsllmmachine-learningml-pipelinesnshkr-crucibleotpquality-assurancereliabilityresearchstatistical-testingvalidation

North-Shore-AI/crucible_deployment

ML model deployment for the Crucible ecosystem. vLLM and Ollama integration, canary deployments, A/B testing, traffic routing, health checks, rollback strategies, and inference serving for Elixir-based ML workflows.

Elixir00Updated 2 months ago

ab-testingaicanary-deploymentdeep-learningelixirinferencellmmachine-learningml-opsmlopsmodel-deploymentmodel-servingnshkr-cruciblenxollamaphoenixresearchscholartraffic-routingvllm

North-Shore-AI/crucible_harness

Experimental research framework for running AI benchmarks at scale

Elixir00Updated 2 months ago

aiautomationbeambenchmarkingelixirensemble-methodsexperiment-automationexperiment-orchestrationllmmachine-learningnshkr-crucibleotpreliabilityreportingresearchresearch-automationstatistical-testingtest-harness

North-Shore-AI/crucible_xai

Explainable AI (XAI) tools for the Crucible framework

Elixir00Updated 2 months ago

aibeamelixirensemble-methodsexplainabilityexplainable-aifeature-attributioninterpretabilitylimellmmachine-learningmodel-interpretabilitynshkr-crucibleotpreliabilityresearchshapstatistical-testingtransparencyxai

North-Shore-AI/crucible_hedging

Request hedging for tail latency reduction in distributed systems

Elixir00Updated 2 months ago

aibeamdistributed-systemselixirensemble-methodshedginglatency-optimizationlatency-reductionllmmachine-learningnshkr-crucibleotpperformance-optimizationreliabilityrequest-hedgingresearchstatistical-testingtail-latency

North-Shore-AI/metrics_ex

Metrics aggregation and alerting for ML experiments—multi-backend export (Prometheus, InfluxDB, Datadog, OpenTelemetry), advanced aggregations (percentiles, histograms, moving averages), threshold-based alerting with anomaly detection (z-score, IQR), and time-series storage. Research-grade observability for the NSAI ecosystem.

Elixir00Updated 1 month ago

alertinganomaly-detectionbeamdatadogelixirhistogramsinfluxdbmetricsml-experimentsmonitoringnorth-shore-ainshkr-crucibleobservabilityopentelemetryotppercentilesprometheusstatisticstelemetrytime-series

North-Shore-AI/eval_ex

Model evaluation harness for standardized benchmarking—comprehensive metrics (F1, BLEU, ROUGE, METEOR, BERTScore, pass@k), statistical analysis (confidence intervals, effect size, bootstrap CI, ANOVA), multi-model comparison, and report generation. Research-grade evaluation for LLM and ML experiments.

Elixir00Updated 1 month ago

ai-researchbeambenchmarkingbleuconfidence-intervalselixirevaluationf1-scorehypothesis-testingllmmachine-learningmetricsmodel-comparisonnorth-shore-ainshkr-crucibleotpreproducibilityresearchrougestatistical-analysis

North-Shore-AI/crucible_ui

Phoenix LiveView dashboard for the Crucible ML reliability stack

Elixir00Updated 2 months ago

nshkr-crucible

North-Shore-AI/crucible_model_registry

ML model registry for the Crucible ecosystem. Artifact storage, model versioning, lineage tracking, metadata management, model comparison, reproducibility, and integration with training pipelines for Elixir-based ML workflows.

Elixir00Updated 2 months ago

aiartifact-managementdata-sciencedeep-learningelixirexperiment-trackinglineage-trackingmachine-learningmetadataml-opsmlopsmodel-managementmodel-registrymodel-versioningnshkr-cruciblenxphoenixreproducibilityresearchscholar

North-Shore-AI/crucible_bench

Statistical testing and analysis framework for AI research

Elixir00Updated 2 months ago

aianovabeambenchmarkingeffect-sizeensemble-methodshypothesis-testingllmmachine-learningmann-whitneynshkr-crucibleotppower-analysisreliabilityresearchstatistical-analysisstatistical-testingstatisticst-testtesting-framework

North-Shore-AI/crucible_adversary

Adversarial testing and robustness evaluation for the Crucible framework

Elixir00Updated 2 months ago

adversarial-attacksadversarial-examplesadversarial-testingaibeamelixirensemble-methodsllmmachine-learningmodel-robustnessnshkr-crucibleotpred-teamingreliabilityresearchrobustnesssecurity-testingstatistical-testing

North-Shore-AI/crucible_framework

CrucibleFramework: A scientific platform for LLM reliability research on the BEAM

Elixir00Updated 2 months ago

aiai-researchbeamdocumentationelixirensemble-methodsexperiment-frameworkllmllm-reliabilityllm-testingmachine-learningnshkr-crucibleotpreliabilityreproducible-researchresearchresearch-frameworkstatistical-testing

North-Shore-AI/crucible_train

ML training orchestration for the Crucible ecosystem. Distributed training, hyperparameter optimization, checkpointing, model versioning, metrics collection, early stopping, LR scheduling, gradient accumulation, and mixed precision training with Nx/Scholar integration.

Elixir00Updated 2 months ago

aicheckpoint-managementdeep-learningdistributed-trainingelixirexperiment-trackinggradient-descenthyperparameter-optimizationmachine-learningmetricsml-opsmlopsmodel-trainingneural-networksnshkr-cruciblenxphoenixresearchscholartraining-framework

North-Shore-AI/crucible_trace

Structured causal reasoning chain logging for LLM transparency

Elixir00Updated 2 months ago

aibeamcausal-reasoningelixirensemble-methodsexplainabilityinterpretabilityllmllm-transparencyloggingmachine-learningnshkr-crucibleotpreasoning-chainsreliabilityresearchstatistical-testingtracingtransparency

North-Shore-AI/crucible_ensemble

Multi-model ensemble voting strategies for LLM reliability

Elixir00Updated 2 months ago

aibeamconsensuselixirensemble-learningensemble-methodsllmllm-ensemblemachine-learningmodel-ensemblemulti-modelnshkr-crucibleotpreliabilityreliability-improvementresearchstatistical-testingvoting

North-Shore-AI/hf_datasets_ex

HuggingFace Datasets for Elixir - A native Elixir port of the popular HuggingFace datasets library. Stream, load, and process ML datasets from the HuggingFace Hub with full BEAM/OTP integration. Supports Parquet streaming, dataset splitting, shuffling, and seamless integration with Nx tensors for machine learning workflows.

Elixir00Updated 2 months ago

aibeamdata-loadingdata-pipelinedataset-managementdatasetsdeep-learningelixirhuggingfacehuggingface-datasetsllmmachine-learningml-datasetsnshkr-cruciblenxotpparquetresearchstreaming

North-Shore-AI/crucible_datasets

Dataset management and caching for AI research benchmarks

Elixir00Updated 2 months ago

aibeambenchmark-datasetsbenchmarkingdata-loadingdataset-managementdatasetselixirensemble-methodsgsm8khumanevalllmmachine-learningml-datasetsmmlunshkr-crucibleotpreliabilityresearchstatistical-testing

North-Shore-AI/cns_crucible

No description provided.

Elixir00Updated 2 months ago

nshkr-crucible

JohnJTK/crucible_train

🚀 Accelerate ML training on the BEAM with CrucibleTrain's unified infrastructure for diverse model types and workflows.

Elixir00Updated 1 hour ago

aicheckpoint-managementdeep-learningdistributed-trainingelixirexperiment-trackinggradient-descenthyperparameter-optimizationml-opsmlopsmodel-trainingneural-networksnshkr-cruciblenxphoenixresearchscholartraining-framework

North-Shore-AI/datasets_ex

Dataset management library for ML experiments—loaders for SciFact, FEVER, GSM8K, HumanEval, MMLU, TruthfulQA, HellaSwag; git-like versioning with lineage tracking; transformation pipelines; quality validation with schema checks and duplicate detection; GenStage streaming for large datasets. Built for reproducible AI research.

Elixir00Updated 1 month ago

beambenchmarksdata-managementdata-qualitydata-validationdatasetselixirfevergenstagegsm8khumanevalmachine-learningmmlunorth-shore-ainshkr-crucibleotpreproducibilityscifactstreamingversioning

North-Shore-AI/crucible_telemetry

Advanced telemetry collection and analysis for AI research

Elixir00Updated 1 month ago

aibeamdata-collectionelixirensemble-methodsexperiment-trackinginstrumentationllmmachine-learningmetricsmonitoringnshkr-crucibleobservabilityotpreliabilityresearchresearch-metricsstatistical-testingtelemetry

North-Shore-AI/training_ir

Training IR for reproducible ML jobs across Crucible and Kitchen. Defines model specs, adapters, learning config, checkpointing, validation, and resource envelopes to standardize training pipelines.

00Updated 2 months ago

aialignmentbatchbeamdatasetelixirerlangevalfine-tuninghexirjsonlibrarymlnshkr-crucibleotppipelineschemaserializationtraining

North-Shore-AI/crucible_feedback

ML feedback loop management for the Crucible ecosystem. Quality monitoring, data drift detection, model performance tracking, data curation, active learning, human-in-the-loop workflows, and continuous improvement for Elixir-based ML.

Elixir00Updated 2 months ago

active-learningaicontinuous-learningdata-curationdata-qualitydata-sciencedeep-learningdrift-detectionelixirfeedback-loophuman-in-the-loopmachine-learningml-opsmlopsmodel-monitoringnshkr-cruciblenxphoenixresearchscholar