Fake News Classifier

Description

Determine if news articles are Real or Fake.

Predicted Entities

REAL, FAKE.

Live Demo
Open in Colab
Download Copy S3 URI

How to use

documentAssembler = DocumentAssembler()\
.setInputCol("text")\
.setOutputCol("document")
use = UniversalSentenceEncoder.pretrained(lang="en") \
.setInputCols(["document"])\
.setOutputCol("sentence_embeddings")
document_classifier = ClassifierDLModel.pretrained('classifierdl_use_fakenews', 'en') \
.setInputCols(["document", "sentence_embeddings"]) \
.setOutputCol("class")

nlpPipeline = Pipeline(stages=[documentAssembler, use, document_classifier])
light_pipeline = LightPipeline(nlp_pipeline.fit(spark.createDataFrame([['']]).toDF("text")))

annotations = light_pipeline.fullAnnotate('Donald Trump a KGB Spy? 11/02/2016 In today’s video, Christopher Greene of AMTV reports Hillary Clinton')

val documentAssembler = DocumentAssembler()
.setInputCol("text")
.setOutputCol("document")
val use = UniversalSentenceEncoder.pretrained(lang="en")
.setInputCols(Array("document"))
.setOutputCol("sentence_embeddings")
val document_classifier = ClassifierDLModel.pretrained("classifierdl_use_fakenews", "en")
.setInputCols(Array("document", "sentence_embeddings"))
.setOutputCol("class")
val pipeline = new Pipeline().setStages(Array(documentAssembler, use, document_classifier))

val data = Seq("Donald Trump a KGB Spy? 11/02/2016 In today’s video, Christopher Greene of AMTV reports Hillary Clinton").toDF("text")
val result = pipeline.fit(data).transform(data)

import nlu

text = ["""Donald Trump a KGB Spy? 11/02/2016 In today’s video, Christopher Greene of AMTV reports Hillary Clinton"""]
fake_df = nlu.load('classify.fakenews.use').predict(text, output_level='document')
fake_df[["document", "fakenews"]]

Results

+--------------------------------------------------------------------------------------------------------+------------+
|document                                                                                                |class       |
+--------------------------------------------------------------------------------------------------------+------------+
|Donald Trump a KGB Spy? 11/02/2016 In today’s video, Christopher Greene of AMTV reports Hillary Clinton | FAKE       |
+--------------------------------------------------------------------------------------------------------+------------+

Model Information

Model Name	classifierdl_use_fakenews
Model Class	ClassifierDLModel
Spark Compatibility	2.5.3
Spark NLP Compatibility	2.4
License	open source
Edition	public
Input Labels	[document, sentence_embeddings]
Output Labels	[class]
Language	en
Upstream Dependencies	with tfhub_use

Data Source

This model is trained on the fake new classification challenge. https://raw.githubusercontent.com/joolsa/fake_real_news_dataset/master/fake_or_real_news.csv.zip

Benchmarking

precision    recall  f1-score   support

FAKE       0.85      0.90      0.88       931
REAL       0.90      0.85      0.88       962

accuracy                           0.88      1893
macro avg       0.88      0.88      0.88      1893
weighted avg       0.88      0.88      0.88      1893

PREVIOUSEmotion Detection Classifier

NEXTSarcasm Classifier