跳到主要内容

34 篇文档带有标签「calculus」

查看所有标签

Classification with a Neural Network

Neural networks are advanced computational models that mimic the human brain's structure, enabling them to capture and model complex, non-linear relationships between inputs and outputs. They consist of layers of perceptrons (neurons) that process inputs through weighted connections.

Definitions

Derivatives are a fundamental concept in calculus, representing the rate at which a function is changing at any given point. The formal definition and various notations are as follows:

Log Loss in Machine Learning

Machine learning often involves optimization problems that aim to minimize or maximize a particular function, known as a loss function. Two of the most common loss functions are square loss and log loss. In this note, we'll delve into log loss by exploring a probability-based example and provide the mathematical foundations for understanding it better.

Optimization using Gradient Descent - Least squares

Linear regression is a statistical method used to model the relationship between a dependent variable and one or more independent variables by fitting a linear equation to observed data. The simplest form of the linear equation with one dependent and one independent variable is represented as $y = mx + b$, where $m$ is the slope of the line and $b$ is the y-intercept. This method is widely used in predictive modeling and quantitative forecasting.

Optimization using Gradient Descent in one variable

Gradient Descent is an iterative method extensively utilized in finding the minimum or maximum of functions, particularly beneficial in multi-variable scenarios. This approach is foundational in optimization problems where exact solutions are challenging to derive analytically, especially due to complexities in higher dimensions.

Optimization using Gradient Descent in two variables

Gradient descent extends to functions of multiple variables by utilizing gradients instead of derivatives. The process involves iteratively moving in the direction opposite to the gradient of the function at the current point, to find a local minimum.

Regression with a perceptron

Neural networks are computational models that mimic the human brain's structure to process information. They consist of units called neurons or perceptrons, which are the fundamental building blocks of neural networks. The training of these networks involves adjusting weights and biases to minimize the error in predictions, a process achieved through algorithms like gradient descent and Newton's method.