Hello!
In the explanation related to the Machine Learning models, there is a formula for computing Mean Squared Error (MSE). I have replicated it below as snapshots.
The following properties of derivatives are found relevant for solving this:
- Power rule: For h(x) = x^n; h’(x) = nx^(n-1)
- Chain rule: For h(x) = f(g(x)); h’(x) = f’g(x) * g’(x)
However, I am struggling to figure out the solution, especially the last x(i)j in the second equation. Any help with step-by-step solution to this problem is appreciated.
MSE:
Derivative of MSE:
Moreover, I would also like to understand what the x(i)j represents; as θj represents the value of θ for each of the feature (i.e. θ1, θ2,…θn) and x(i) defines the feature vector for the ith instance.
Thanks!
Tag:@sandeepgiri