Well, I hope I'm not late for the party! Let me first try to establish some intuition before digging into loads of information ( warning : this is not a short comparison)
h(x) .
, , , (.. , ..).

- (aka Thetas Weights), . , , , - , !
, (.. ) (. . , , ).
, , (.. ) , , , : 
(.. ), = 0 ( , ).
, , , , Local Optima; :

, , : Deravative, Tangent Line, Cost Function, Hypothesis..etc.
: (. ).
:
f(x) x=a. L (x) : L(x)=f(a)+f′(a)(x−a).
:

, x=a . L(x) f(x) x=a. x=a.
:
, , , , 0, .
( , ), :

, , , , ,... ( ): Qa(x) = f(a) + f'(a)(xa) + f''(a)(xa)2/2
.
1.
x: (.. ).
. , (.. ).
( - nxn).
, , f(x) xn / ( ). , f(x) , .
:
- (.. ).
, (.. , , !).
2. . ---. :
, , , ( ). , .
" " , , .
, , L-BFGS , , "" , , , ,
3. :
, ( , , ).
(CD), , .
LIBLINEAR ICML 2008. (aka L1 Regularization), , ( )
:
(.. ), .
.
() ; "-vs-rest", .
: Scikit: "liblinear" .
4. :
SAG . (SG), SAG . SAG , SG .
, , .
:
L2.
O(N), N ( ).
5. SAGA:
SAGA - SAG, = l1 ( L1-). , , .
: Scikit: SAGA .
Scikit
