CIKM AnalytiCup 2017: Lazada Product Title Quality Challenge Forum

Go back to competition Back to thread list Post in this thread

> Baseline - Logistic Regression

This is a baseline using logistic regression.

We approach these binary classification tasks (clarity and conciseness) as probability estimation.
The features we are using include:
- Length of the title (integer number)
- The title contains number or not (0 or 1)
We are using the same features for both clarity and conciseness models. These features are quite basic and naive, so feel free to add your fancy things to improve the models.

The code is available on Github:

Posted by: qttruong @ April 28, 2017, 9:59 a.m.
Post in this thread