DiegoPaniagua23/statistical-computing
Mathematical Foundations and Statistical Computing in R
Statistical Computing
This repository contains assignments and projects for the Statistical Computing course in the Master's program at CIMAT (Center for Research in Mathematics). The coursework emphasizes the practical application of statistical theory using computational methods, primarily with R programming.
๐ Course Content
This course covers key areas of statistical computing including:
- Generalized linear models
- Logistic and Poisson regression models
- Generalized linear models, the general case
- Linear models of variance and covariance analysis
- Log-linear models
- Time series analysis
- Stochastic processes
- Autocovariance and autocorrelation
- Stationary time series
- Autoregressive processes
- Moving average processes
- ARMA processes
- Spectral analysis of time series
- Computational intensive estimation methods
- EM algorithm
- MCMC algorithms
- Bootstrap
- Model evaluation and selection in regression analysis
- Criteria for evaluating and selecting the appropriate model
- Prediction error estimation: Cross-validation
- Model selection methods: Stepwise methods, AIC, BIC
- Variable selection methods: Regularization, Ridge,
LARS, LASSO
- Data imputation methods
- Methods based on regression and covariance analysis.
- Methods based on the EM algorithm
- Bayesian imputation
- Methods based on machine learning techniques
๐ Repository Structure
The repository follows the next structure:
statistical-computing/
โโโ homework/
โ โโโ 01_poisson_logistic_regression/
โ โ โโโ R/
โ โ โ โโโ data/
โ โ โ โโโ results/
โ โ โ โโโ scripts/
โ โ โโโ report/
โ โ โโโ README.md
โ โ โโโ setup_renv.R
โ โโโ 02_glms_categorical_and_count_data/
โ โ โโโ R/
โ โ โ โโโ data/
โ โ โ โโโ results/
โ โ โ โโโ scripts/
โ โ โโโ report/
โ โ โโโ README.md
โ โ โโโ setup_renv.R
โโโ LICENSE
โโโ README.md
๐ Assignments
The current course progress:
| Assignment | Topic | Key Methods | Link |
|---|---|---|---|
| 01 | Poisson & Logistic Regression | GLMs, MLE, McNemar's Test | ๐ View |
| 02 | GLMs: Categorical & Count Data | Logistic models, Poisson regression, diagnostics | ๐ View |
| 03 | ... | ... | ... |
| 04 | ... | ... | ... |
๐ Technical Stack
Programming & Analysis:
- R (โฅ4.3.0): Statistical computing and graphics
- Key R Packages:
dplyr,ggplot2,MASS
Documentation & Reporting:
- LaTeX: Professional mathematical typesetting
- Markdown: Repository documentation
- Git: Version control
Development Tools:
- RStudio: Integrated development environment
- renv: Reproducible package management
๐ Getting Started
Prerequisites
Ensure you have the following installed:
- R (version โฅ4.3.0)
- RStudio (recommended)
- LaTeX distribution (TeX Live, MiKTeX, or MacTeX)
- Git
๐ License
This project is licensed under the MIT License - see the LICENSE file for details.
This repository represents academic work in statistical computing, demonstrating the integration of mathematical theory with practical data analysis using modern computational tools.