Derivative of MSE function

Dhyey_Kotecha · May 29, 2020, 4:45am

Hello!

In the explanation related to the Machine Learning models, there is a formula for computing Mean Squared Error (MSE). I have replicated it below as snapshots.

The following properties of derivatives are found relevant for solving this:

Power rule: For h(x) = x^n; h’(x) = nx^(n-1)
Chain rule: For h(x) = f(g(x)); h’(x) = f’g(x) * g’(x)

However, I am struggling to figure out the solution, especially the last x(i)j in the second equation. Any help with step-by-step solution to this problem is appreciated.

MSE:
Derivative of MSE:

Moreover, I would also like to understand what the x(i)j represents; as θj represents the value of θ for each of the feature (i.e. θ1, θ2,…θn) and x(i) defines the feature vector for the ith instance.

Thanks!

Tag:@sandeepgiri

Ankur_Sinha · May 28, 2020, 2:00pm

Hi Dhyey,

The derivative here is the Partial Derivative of the equation. Partial derivative means differentiating the equation with respect to just one variable, in our case, theta j, and keeping the other variables constant, and the two rules of multi-variate calculus i.e. power rule and the chain rule is used.

I hope it helps.
Regards.

Dhyey_Kotecha · May 29, 2020, 3:04am

Thank you for the help, @Ankur_Sinha.

Would you mind giving step-wise solution for calculating the partial derivative of the MSE function given below, please?

Ankur_Sinha · May 29, 2020, 4:05am

Hi Dhyey,

Here is the way I understood it. I hope it helps.

Regards.

Dhyey_Kotecha · May 29, 2020, 4:11am

Hi! Ankur,

It was indeed helpful to me.

Many thanks for your time and effort.

Ankur_Sinha · May 29, 2020, 4:16am

Hi Dhyey,

No Problem at all.

Happy Learning !!
Regards.