AI Overview

Important

A very old overview of the starting point for my Machine Learning work. This was before I had a large graph network, and wanted to introduce structure to the notes. For more modern notes, they can be found more organically litered throughout the Artificial Intelligence directory

In addition, a lot of this was AI generated, which I have sort of moved on from nowadays as I find more use in writing the notes myself.

Neurons

Structure of a Artificial Neuron
- Inputs, Weights, Biases, Activation Functions
Activation Functions
- Role of activation functions
- Common types: Sigmoid, ReLU, Tanh, Softmax, etc.
Learning Process
- How neurons learn (weight adjustments)
- Gradient Descent and backpropagation (coming soon)

Architecture

Layers of a Neural Network
- Input layer, Hidden layers, Output layer
Types of Layers
- Fully connected (Dense) layers
- Convolutional layers (for CNNs)
- Recurrent layers (for RNNs)
Choosing the Number of Layers and Neurons
- How to decide the depth and width of your network

I have also layed out the workflow for how to design a model to fit a certain task/dataset in Network Modeling Workflow

Loss Functions

Loss Functions
- Purpose and types: MSE, Cross-Entropy, Hinge, Huber
Optimization Algorithms
- Gradient Descent (SGD, Mini-batch, etc.)
- Advanced optimizers: Adam, RMSprop, Adagrad
Regularization Techniques
- Preventing overfitting: L1/L2 regularization, Dropout, etc.

Training

Forward Propagation
- How inputs are processed through the network
Backward Propagation
- Calculating gradients and updating weights
Training Cycles
- Epochs, Batches, and Iterations
Evaluation Metrics
- Accuracy, Precision, Recall, F1 Score, etc.

Data Processing

Dataset Collection
- Finding and curating datasets
Data Preprocessing
- Normalization, Standardization, Handling missing data
Data Augmentation
- Techniques to artificially increase dataset size (especially in image processing)

Model Eval

Cross-Validation
- Ensuring generalization with techniques like k-fold cross-validation
Hyperparameter Tuning
- Tuning learning rate, batch size, number of epochs, etc.
- Grid Search, Random Search, Bayesian Optimization
Model Evaluation
- Testing on unseen data
- Analyzing confusion matrix and other metrics

Generative Modeling

Variational Autoencoder
- Changes the task of a neural network from predicting some exact output parameters to preciting a probability distribution, and then randomly sampling from said distribution.
Boltzmann Machine
Diffusion in Machine Learning

Graph View

AI Overview

Neurons

Architecture

Loss Functions

Training

Data Processing

Model Eval

Generative Modeling

Backlinks