Modeling Non Linear Friction Model using UDEs

Friction between moving bodies is not trivial to model. There have been idealised linear models which are not always useful in complicated systems. There have been many theories and non linear models which we can use, but they are not perfect. The aim of this tutorial to use Universal Differential Equations to showcase how we can embed a neural network to learn an unknown non linear friction model.

Julia Environment

First, lets import the required packages.

using ModelingToolkitNeuralNets
using ModelingToolkit
import ModelingToolkit.t_nounits as t
import ModelingToolkit.D_nounits as Dt
using ModelingToolkitStandardLibrary.Blocks
using OrdinaryDiffEqVerner
using Optimization
using OptimizationOptimisers: Adam
using SciMLStructures
using SciMLStructures: Tunable
using SymbolicIndexingInterface
using Statistics
using StableRNGs
using Lux
using Plots

Problem Setup

Let's use the friction model presented in https://www.mathworks.com/help/simscape/ref/translationalfriction.html for generating data.

Fbrk = 100.0
vbrk = 10.0
Fc = 80.0
vst = vbrk / 10
vcol = vbrk * sqrt(2)
function friction(v)
    sqrt(2 * MathConstants.e) * (Fbrk - Fc) * exp(-(v / vst)^2) * (v / vst) +
    Fc * tanh(v / vcol)
end

friction (generic function with 1 method)

Next, we define the model - an object sliding in 1D plane with a constant force Fu acting on it and friction force opposing the motion.

function friction_true()
    @variables y(t) = 0.0
    @constants Fu = 120.0
    eqs = [
        Dt(y) ~ Fu - friction(y)
    ]
    return System(eqs, t, name = :friction_true)
end

friction_true (generic function with 1 method)

Now that we have defined the model, we will simulate it from 0 to 0.1 seconds.

model_true = mtkcompile(friction_true())
prob_true = ODEProblem(model_true, [], (0, 0.1))
sol_ref = solve(prob_true, Vern7(); saveat = 0.001)

retcode: Success
Interpolation: 1st order linear
t: 101-element Vector{Float64}:
 0.0
 0.001
 0.002
 0.003
 0.004
 0.005
 0.006
 0.007
 0.008
 0.009
 ⋮
 0.092
 0.093
 0.094
 0.095
 0.096
 0.097
 0.098
 0.099
 0.1
u: 101-element Vector{Vector{Float64}}:
 [0.0]
 [0.11693524394893327]
 [0.2281512262108343]
 [0.33445703690076717]
 [0.4368047256192944]
 [0.5361804950028548]
 [0.6335346088517151]
 [0.7297393267305404]
 [0.8255638431663777]
 [0.9216580746475652]
 ⋮
 [8.586398522181526]
 [8.662876552123254]
 [8.739051495229617]
 [8.814926325448004]
 [8.890503985665559]
 [8.965787387930437]
 [9.040779413677745]
 [9.115482913959694]
 [9.189900709679812]

Let's plot it.

scatter(sol_ref, label = "velocity")

That was the velocity. Let's also plot the friction force acting on the object throughout the simulation.

scatter(sol_ref.t, friction.(first.(sol_ref.u)), label = "friction force")

Model Setup

Now, we will try to learn the same friction model using a neural network. We will use NeuralNetworkBlock to define neural network as a component. The input of the neural network is the velocity and the output is the friction force. We connect the neural network with the model using RealInputArray and RealOutputArray blocks.

function friction_ude(Fu)
    @variables y(t) = 0.0
    @constants Fu = Fu

    chain = Lux.Chain(
        Lux.Dense(1 => 10, Lux.mish, use_bias = false),
        Lux.Dense(10 => 10, Lux.mish, use_bias = false),
        Lux.Dense(10 => 1, use_bias = false)
    )
    @named nn = NeuralNetworkBlock(1, 1; chain = chain, rng = StableRNG(1111))

    eqs = [Dt(y) ~ Fu - nn.outputs[1]
           y ~ nn.inputs[1]]
    return System(eqs, t, name = :friction, systems = [nn])
end

Fu = 120.0

ude_sys = friction_ude(Fu)
sys = mtkcompile(ude_sys)

\[ \begin{align} \frac{\mathrm{d} y\left( t \right)}{\mathrm{d}t} &= \mathtt{Fu} - \mathtt{nn.outputs}\_{1}\left( t \right) \end{align} \]

Optimization Setup

We now setup the loss function and the optimization loop.

function loss(x, (prob, sol_ref, get_vars, get_refs, set_x))
    new_p = set_x(prob, x)
    new_prob = remake(prob, p = new_p, u0 = eltype(x).(prob.u0))
    ts = sol_ref.t
    new_sol = solve(new_prob, Vern7(), saveat = ts, abstol = 1e-8, reltol = 1e-8)

    if SciMLBase.successful_retcode(new_sol)
        mean(abs2.(reduce(hcat, get_vars(new_sol)) .- reduce(hcat, get_refs(sol_ref))))
    else
        Inf
    end
end

of = OptimizationFunction(loss, AutoForwardDiff())

prob = ODEProblem(sys, [], (0, 0.1))
get_vars = getu(sys, [sys.y])
get_refs = getu(model_true, [model_true.y])
set_x = setp_oop(sys, sys.nn.p)
x0 = default_values(sys)[sys.nn.p]

cb = (opt_state, loss) -> begin
    @info "step $(opt_state.iter), loss: $loss"
    return false
end

op = OptimizationProblem(of, x0, (prob, sol_ref, get_vars, get_refs, set_x))
res = solve(op, Adam(5e-3); maxiters = 10000, callback = cb)

retcode: Default
u: 120-element Vector{Float64}:
 -2.3641118239060392
 -2.4197749188511883
  2.05532041545069
 -0.2996514168514924
  0.965739704955372
 -2.4091508519389278
 -2.321816993091376
  0.5514058591885169
 -2.4364889220270984
 -2.42616826727813
  ⋮
 -2.117723250646189
  1.2311467947326067
 -1.0464471188690208
 -0.2511304265993364
 -1.5963945495228824
  1.4534943415711565
 -2.2826851813612397
 -1.4113413831922523
 -1.5278951396437614

Visualization of results

We now have a trained neural network! We can check whether running the simulation of the model embedded with the neural network matches the data or not.

res_p = set_x(prob, res.u)
res_prob = remake(prob, p = res_p)
res_sol = solve(res_prob, Vern7(), saveat = sol_ref.t)

Also, it would be interesting to check the simulation before the training to get an idea of the starting point of the network.

initial_sol = solve(prob, Vern7(), saveat = sol_ref.t)

retcode: Success
Interpolation: 1st order linear
t: 101-element Vector{Float64}:
 0.0
 0.001
 0.002
 0.003
 0.004
 0.005
 0.006
 0.007
 0.008
 0.009
 ⋮
 0.092
 0.093
 0.094
 0.095
 0.096
 0.097
 0.098
 0.099
 0.1
u: 101-element Vector{Vector{Float64}}:
 [0.0]
 [0.12001891744205449]
 [0.2400764804878787]
 [0.36017398707424014]
 [0.48031243724832634]
 [0.6004921652167234]
 [0.7207126932550912]
 [0.8409727953158461]
 [0.9612707065256714]
 [1.0816043755568918]
 ⋮
 [11.145770752943914]
 [11.267936491274247]
 [11.390125983512725]
 [11.512339275524702]
 [11.634576411475662]
 [11.75683743387175]
 [11.879122383595494]
 [12.001431299945716]
 [12.12376422067228]

Now we plot it.

scatter(sol_ref, idxs = [model_true.y], label = "ground truth velocity")
plot!(res_sol, idxs = [sys.y], label = "velocity after training")
plot!(initial_sol, idxs = [sys.y], label = "velocity before training")

It matches the data well! Let's also check the predictions for the friction force and whether the network learnt the friction model or not.

scatter(sol_ref.t, friction.(first.(sol_ref.u)), label = "ground truth friction")
plot!(res_sol.t, getindex.(res_sol[sys.nn.outputs], 1),
    label = "friction from neural network")

It learns the friction model well!