comparison with SGDClassifier #1

amueller · 2013-07-30T16:56:32Z

Hey. Did you compare with SGDClassifier?
The results should be quite close to yours.

ejlb · 2013-07-30T17:10:25Z

I will compare, the original paper did some comparisons with SGD (not sklearn's implementation) and they found that the projection step and adaptive learning rate improved performance.

amueller · 2013-07-30T17:22:07Z

The SGD in scikit-learn actually has an adaptive learning rate - it can even be set to be the same as pegasos, I believe.
For the projection step, the claims are much milder in the journal version of the paper and in the source code they provide it is commented out.
I have not seen a careful analysis of the projection step, though, and would be quite interested in that.

amueller · 2013-07-30T17:25:15Z

After looking it up again, I think you need to set power_t=1 to get the pegasos schedule.

ejlb · 2013-08-06T09:48:05Z

Here are some benchmarks with identical learning rates:

https://raw.github.com/ejlb/pegasos/master/benchmarks/benchmarks.png

Pegasos seems to be slightly more accurate (1%). The only two differences I know of are:

pegasos projection
pegasos trains on random examples so may get a better generalisation error.

Due to point 2) it is hard to compare speed across iterations.

amueller · 2013-08-06T09:50:54Z

Wow that looks quite good. I'm quite surprised your implementation is significantly faster than sklearn.
Do you have any idea where that could come from?
Also, could you please share your benchmark script?

cc @pprett @larsmans

amueller · 2013-08-06T09:53:03Z

You say that training on random samples makes it had to compare speed.s How so? One iteration of sgd are n_samples many updates, which you should compare against n_samples many updates in pegasos. Or did you compare against single updates here?

ejlb · 2013-08-06T09:53:19Z

@amueller SGDClassifier trains on the whole data set at each iteration I assume? It is probably where the speed increase comes from

edit: yes true, that would be a good comparison. Will upload the benchmark script

amueller · 2013-08-06T09:54:51Z

Ok, but then the plot doesn't make sense. You should rescale it such that the number of weight updates is the same.

ejlb · 2013-08-06T10:02:48Z

Yeah, will run some with equal weight updates

larsmans · 2013-08-06T10:04:33Z

Yes, SGDClassifier does

for i in xrange(n_iter):
    shuffle(dataset)
    for x in X:
         update()

It also wastes a little bit of time in each update, checking whether it should do a PA update or a vanilla additive one.

ejlb · 2013-08-06T10:55:19Z

this makes much more sense:

https://raw.github.com/ejlb/pegasos/master/benchmarks/weight_updates/benchmarks.png

Perhaps batching the pegasos weight updates would retain the slight accuracy boost and improve the training time

amueller · 2013-08-06T21:29:38Z

Yeah, that looks more realistic ;)
How did you set alpha and did you set eta0 in the SGD?

ejlb · 2013-08-07T08:54:50Z

I used this: SGDClassifier(power_t=1, learning_rate='invscaling', n_iter=sample_coef, eta0=0.01). The full benchmark is here: https://github.com/ejlb/pegasos/blob/master/benchmarks/weight_updates/benchmark.py

ghost assigned ejlb Jul 30, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

comparison with SGDClassifier #1

comparison with SGDClassifier #1

amueller commented Jul 30, 2013

ejlb commented Jul 30, 2013

amueller commented Jul 30, 2013

amueller commented Jul 30, 2013

ejlb commented Aug 6, 2013

amueller commented Aug 6, 2013

amueller commented Aug 6, 2013

ejlb commented Aug 6, 2013

amueller commented Aug 6, 2013

ejlb commented Aug 6, 2013

larsmans commented Aug 6, 2013

ejlb commented Aug 6, 2013

amueller commented Aug 6, 2013

ejlb commented Aug 7, 2013

comparison with SGDClassifier #1

comparison with SGDClassifier #1

Comments

amueller commented Jul 30, 2013

ejlb commented Jul 30, 2013

amueller commented Jul 30, 2013

amueller commented Jul 30, 2013

ejlb commented Aug 6, 2013

amueller commented Aug 6, 2013

amueller commented Aug 6, 2013

ejlb commented Aug 6, 2013

amueller commented Aug 6, 2013

ejlb commented Aug 6, 2013

larsmans commented Aug 6, 2013

ejlb commented Aug 6, 2013

amueller commented Aug 6, 2013

ejlb commented Aug 7, 2013