Reuters-21578 text classification with Gensim and Keras

Fork Reuters-21578 is a collection of about 20K news-lines (see reference for more information, downloads and copyright notice), structured using SGML and categorized with 672 labels. They are diveded into five main categories: Topics Places People Organizations Exchanges However, most of them are unused and, looking at the distribution, it’s…

Back to basics!

This is my brand new blog. I always continue updating my original one (Bonaccorso.org), however I’m going to post here articles related to various IT aspects like: Software Engineering, IT Project Management, OOA/OOD/OOP, J2EE, Enterprise and Mobile Architectures and, of course, funny stuff about my job! Photo credit: fundamentals of…