AutoDiffSharp

This repo implements simple automatic differentiation (AD) in C# via operator overloading to implement dual numbers for forward mode AD, and what seems to be a new representation for the dual of dual numbers, Codual numbers that implement reverse mode AD.

This probably isn't a super efficient implementation, but it's fine for small tests and for learning how AD works.

Dual Numbers

The most straightforward implemenation of AD is based on dual numbers. Each regular number is augmented with an extra term corresponding to it's derivative:

real number x   =(dual number)=>   x + x'*ϵ

Arithmetic and other mathematical functions then have translations to operating on these extended number types as follows:

Operator	Translated
<x, x'> + <y, y'>	<x + y, x' + y'>
<x, x'> - <y, y'>	<x - y, x' - y'>
<x, x'> * <y, y'>	<xy, y'x - y*x'>
<x, x'> / <y, y'>	<x / y, (x'y - xy')/y²>
<x, x'>^k	<x^k, kx^(k-1)x'>

See the wikipedia page for more transformations, like standard trig functions.

Dual Numbers from Operator Overloading

Invoking a function "f" with dual numbers operates like this, in math notation:

f(x0 + ϵ_x1, x1 + ϵ_x2, x2 + ϵ_x2)

Each parameter carries an extra 'ϵ' value corresponding to the derivative, and this extra value is distinct from the 'ϵ' values of all other parameters. In order to carry all the derivatives of all input parameters, we'd need an array of derivatives where the index corresponds to the derivative for the parameter at that index. This is pretty inefficient however, but if you're interested to see how that works, check out the history of this repo here.

In order to make differentiation efficient, we'll first implement so-called forward mode automatic differentiation with dual numbers:

public readonly struct Dual
{
    public readonly double Magnitude;
    public readonly double Derivative;

    internal Dual(double m, double d)
    {
        this.Magnitude = m;
        this.Derivative = d;
    }
}

Since the dual number only carries around a single derivative, it must correspond to only one of the ϵ_x values, which means we can only take the derivative of one of the input parameters at a time. See the section below on reverse mode automatic differentiation for an efficient way to compute all derivatives simultaneously.

A differentiable function of 3 parameters would have this signature:

Dual Function(Dual x0, Dual x1, Dual x2)

Internally, differentiation invokes the function like this:

public static Dual DifferentiateX0At(
    double x0, double x1, double x2,
    Func<Dual, Dual, Dual, Dual> func) =>
    func(new Dual(x0, 1),
         new Dual(x1, 0),
         new Dual(x2, 0));

Each parameter is initialized with zeroes except for the derivative we're interested in.

Operators are now pretty straightforward, corresponding to operations on the magnitude and derivative:

public static Dual operator +(Dual lhs, Dual rhs) =>
    new Dual(lhs.Magnitude + rhs.Magnitude,
             lhs.Derivative + rhs.Derivative);

public static Dual operator *(Dual lhs, Dual rhs) =>
    new Dual(lhs.Magnitude * rhs.Magnitude,
             lhs.Derivative * rhs.Magnitude + rhs.Derivative * lhs.Magnitude);

Once you have your return value of type Dual, you can access the derivative:

var y_dx0 = Calculus.DifferentiateX0At(x0, x1, function);
Console.WriteLine("x0' = " + y_dx0.Derivative);

var y_dx1 = Calculus.DifferentiateX1At(x0, x1, function);
Console.WriteLine("x1' = " + y_dx1.Derivative);

Reverse Mode Automatic Differentiation

The above description is for forward-mode AD, but taking the dual of the Dual type above yields a representation for so-called "reverse mode" AD. Instead of computing the derivative alongside the value, we instead construct a continuation that runs the derivative computation backwards from outputs to inputs:

public readonly struct Codual
{
    public readonly double Magnitude;
    public readonly Action<double> Derivative;

    internal Dual(double m, Action<double> d)
    {
        this.Magnitude = m;
        this.Derivative = d;
    }
    
    public static Codual operator +(Codual lhs, Codual rhs) =>
        new Codual(lhs.Magnitude + rhs.Magnitude, dx =>
        {
            lhs.Derivative(dx);
            rhs.Derivative(dx);
        });
        
    public static Codual operator *(Codual lhs, Codual rhs) =>
        new Codual(lhs.Magnitude * rhs.Magnitude, dx =>
        {
            lhs.Derivative(dx * rhs.Magnitude);
            rhs.Derivative(dx * lhs.Magnitude);
        });
}

Where Dual is restricted to efficiently computing the derivative of only one parameter, Codual can efficiently compute the derivative of all parameters simultaneously:

var y = Calculus.DifferentiateAt(x0, x1, function);
Console.WriteLine("x0' = " + y.Derivative(0));
Console.WriteLine("x1' = " + y.Derivative(1));

We can achieve the same for Dual by using a vector of derivatives instead of a single derivative, as I was doing in an older version of this repo, but this has considerably worse space and time complexity.

In general, forward-mode AD is best suited for functions of type R->R^N, which are functions of a single real number to a set of real numbers, where reverse mode AD is best suited for functions of type R^N->R. The latter type are pretty common in machine learning these days.

This means reverse mode AD is not suitable for functions that outputs many results, or functions in which you're only interested in a small subset of the derivatives.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
AutoDiffSharp.Tests		AutoDiffSharp.Tests
AutoDiffSharp		AutoDiffSharp
Nuget		Nuget
.gitignore		.gitignore
.hgtags		.hgtags
AutoDiffSharp.sln		AutoDiffSharp.sln
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AutoDiffSharp.Tests

AutoDiffSharp.Tests

AutoDiffSharp

AutoDiffSharp

Nuget

Nuget

.gitignore

.gitignore

.hgtags

.hgtags

AutoDiffSharp.sln

AutoDiffSharp.sln

LICENSE

LICENSE

README.md

README.md

Repository files navigation

AutoDiffSharp

Dual Numbers

Dual Numbers from Operator Overloading

Reverse Mode Automatic Differentiation

About

Releases

Packages

Languages

License

naasking/AutoDiffSharp

Folders and files

Latest commit

History

Repository files navigation

AutoDiffSharp

Dual Numbers

Dual Numbers from Operator Overloading

Reverse Mode Automatic Differentiation

About

Resources

License

Stars

Watchers

Forks

Languages