Use case:
I have a small dataset with about 3-10 samples in each class. I am using sklearn SVC to classify those with RBF kernel. I need the confidence of the prediction along with the predicted class. I used the predict_proba method of SVC. I was getting weird results with that. I searched a bit and found out that it makes sense only for larger datasets.
Found this question on stack Scikit-learn predict_proba gives wrong answers.
The author of the question verified this by multiplying the dataset, thereby duplicating the dataset.
My questions:
1) If I multiply my dataset by let's say 100, having each sample 100 times, it increases the "correctness" of "predict_proba". What side effects will it have? Overfitting?
2) Is there any other way I can calculate the confidence of the classifier? Like distance from the hyperplanes?
3) For this small sample size, is SVM a recommended algorithm or should I choose something else?