Tuning large scale deduplication with reduced effort
Word co-occurrence features for text classification
Information Systems
Proceedings of the 25th International Conference on Scientific and Statistical Database Management - SSDBM
Carlos Alberto Heuser
Wagner Meira Jr.
Thierson Couto
Leonardo Rocha
Fábio Figueiredo
Renata Galante
Discovering health-related knowledge in social media using ensembles of heterogeneous features