paint-brush
Document Classification Process: 7 Pragmatic Approaches For Small Datasetsby@neptuneAI_jakub
241 reads

Document Classification Process: 7 Pragmatic Approaches For Small Datasets

by neptune.ai Jakub Czakon11mMay 2nd, 2020
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Text classification is one of the predominant tasks in Natural language processing. It has many applications including news type classification, spam filtering, toxic comment identification. For the majority of real-life problems your dataset is small and if you want to build your machine learning model you need to be smart. In this article, we will focus on the “Text Representation” step of this pipeline. We will use the data from Real or Not? NLP with disaster tweets kaggle competition to predict which tweets are about real disasters and which ones are not.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - Document Classification Process: 7 Pragmatic Approaches For Small Datasets
neptune.ai Jakub Czakon HackerNoon profile picture
neptune.ai Jakub Czakon

neptune.ai Jakub Czakon

@neptuneAI_jakub

Senior data scientist building experiment tracking tools for ML projects at https://neptune.ai

L O A D I N G
. . . comments & more!

About Author

neptune.ai Jakub Czakon HackerNoon profile picture
neptune.ai Jakub Czakon@neptuneAI_jakub
Senior data scientist building experiment tracking tools for ML projects at https://neptune.ai

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite