tackle large sets of data
2017-06-23 本文已影响0人
awakeLives
a. Big data sets with high cardinality can be tackled well with linear online models. Tools like vowpal wabiit. FTRL, libfm, libffm, liblinear are good tools
b. Consider sparse matrices in python (things like csr matrices)
c. Consider ensembling (like combining) models trained on smaller parts of the data