Recover named features from L1 regularized logistic regression

I have the following pipeline:

sg = Pipeline([('tfidf', TfidfVectorizer()), ('normalize', Normalizer()), ('l1', LogisticRegression(penalty="l1", dual=False))])

and after peforming the fitting, I want to extract the tokens that correnponds to the non-zero weights.

How can I do this?

-------------Problems Reply------------

features = pipeline.named_steps['tfidf'].get_feature_names()
print(features[pipeline.named_steps['l1'].coef_ != 0])

See TfidfTransformer docs, LogisticRegression docs and the unmerged improved pipeline docs here

Category:python Views:2 Time:2016-06-25

Related post

Copyright (C) dskims.com, All Rights Reserved.

processed in 0.245 (s). 11 q(s)