2024¶

December 28, 2024
in Machine Learning, Neural Networks, Loss Functions
9 min read

Cross Entropy Loss

Cross-Entropy is a widely used loss function, especially in classification tasks, that measures the difference between two probability distributions.

Binary Cross-Entropy plot — Binary Cross-Entropy (BCE)

December 12, 2024
in Mathematics, Programming, Optimizations, Machine Learning, Deep Learning
15 min read

Gradient Descent Ninja with Momentum

Today, we'll build the gradient descent for a complex function. It's not as easy as it was for the 2D parabola; we need to construct a more complicated method! Momentum - a powerful method to help us solve this challenge!

Local minima stuck — Gradient descent got stuck in a local minimum!

December 10, 2024
in Mathematics, Programming, Computational Methods
5 min read

Instability in Numerical Differentiation

Using the Centered Difference approximation of the derivative can lead to numerical instability. Function optimization is a precise task; we cannot afford methods that introduce instability and unpredictability into our results. I've discovered a specific case that illustrates this issue.

Oscillating Function VS Exact Derivative

December 5, 2024
in Mathematics, Programming, Optimizations, Machine Learning, Data Science
9 min read

Gradient Descent - Downhill to the Minima

The gradient, \( \nabla f(\textbf{x}) \), is a vector of partial derivatives of a function. Each component tells us how fast our function is changing. If you want to optimize a function, you head in the negative gradient direction because the gradient points towards the steepest ascent.

Tangent Line of a function at a given point

December 4, 2024
in Mathematics, Programming, Machine Learning, Data Science, Optimizations
10 min read

Why Does the Gradient Point Upwards?

The gradient, \( \nabla f(\textbf{x}) \), tells us the direction in which a function increases the fastest. But why?

Gradient direction in 3D from Min => Max