Up sampling Yeas. This will duplicate Yeas until Training is 50% Yeas.
All Entries: 1840 Affirmitive examples: 367 (20%)
Training: 2346 Affirmitive examples: 1173.0 (50.0%)
Testing: 368 Affirmitive examples: 68.0 (18.0%)
0s: 0.82 True Positives: 63 Accuracy: 0.9619565217391305
1s: 0.18 True Negatives: 291 AUC: 0.9799019607843138
False Positives: 9 F1 Score: 0.8999999999999999
False Negatives: 5 Recall (fract of actual yeas found): 0.9264705882352942
Precision (correctness of yeas predicted): 0.875
Trianed on 2019-04-23_best-guess_binary.csv. Embedding = "word2vec".
Model = "VotingClassifier(estimators=[('logistic', LogisticRegression(C=1000000000.0, class_weight='balanced', dual=False,
fit_intercept=False, intercept_scaling=1, max_iter=100,
multi_class='warn', n_jobs=None, penalty='l2', random_state=None,
solver='warn', tol=0.0001, verbose=0, warm_start=Fa... verbose=0, warm_start=False)), ('GaussianNB', GaussianNB(priors=None, var_smoothing=1e-09))],
flatten_transform=None, n_jobs=None, voting='soft',
weights=[1, 1, 1, 1, 1])".