facqa

https://github.com/baquer/facqa

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
✓
Committers with academic emails
1 of 2 committers (50.0%) from academic institutions
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.4%) to scientific vocabulary

Last synced: 11 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: baquer
License: mit
Language: Jupyter Notebook
Default Branch: master
Size: 2.4 MB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created over 3 years ago · Last pushed over 3 years ago

Metadata Files

Readme License Citation

FacQA

Video Demo

FacQA Website

About The Project

FacQA is a Fake News Detection App build using python Flask and Question Answer Generation ML Model
It is a Part of Debunkathon organised by shastra IIT Madras

Working

The completion of the Model/Tool involved several steps in determining the authenticity of an article. The steps are as follows: * Scraping articles: The first step involves collecting articles from various sources on the internet. * Generating Question-Answer pairs: The next step is to generate a Question-Answer pair from the article to be tested for its authenticity. * Generating Answers: For the Questions generated for step 1, we generate answers from the scrapped articles. * Measuring Text Similarity: After the answers have been generated, the text similarity between the answers generated in steps 2 and 3 is measured using BERT. This helps to determine the authenticity of the article by comparing the answers generated with the answers in the article. * Calculating Bleu Score: Along with the BERT score, the Bleu score is also calculated to measure the similarity between the answers generated and the actual answers in the article. * Deploying a Website: The final step is to deploy a website where users can input articles and receive the results of the authenticity check. This makes the tool easily accessible to the general public.

Architecture

Screenshot

Limitations

When scraping data from websites, we found that there is a limit to the number of requests that can be made before the website blocks further requests. This can hinder data collection and slow down the research process.
Deploying websites with pre-trained models can make the website slow and less responsive. This is because these models can be quite large and computationally expensive, which can cause delays in processing and returning results to users.
We also experienced issues with too many dependencies, which can make the project difficult to manage and maintain. Installing and updating multiple packages can be time-consuming and lead to compatibility issues, which can cause errors and bugs in the code.
The approach is limited to text input only.

Future Works

Some of the possible future work that could be done to improve upon the current approach are as follows:
* Predicting news based on the article's URL rather than just the headlines and article text could be a valuable addition to the model. This would involve extracting data from the URL, such as the source or keywords, to gain further insight into the nature of the article and improve prediction accuracy. * As visual media becomes increasingly prominent in news reporting, it is important to be able to distinguish between authentic and fake images. By training a CNN on image datasets, it could be possible to accurately identify images that are being used in a misleading or deceptive way.
These potential areas of future work would involve expanding the scope of the research to include additional data sources and types of analysis. By doing so, we could improve the accuracy and comprehensiveness of our predictions, and contribute to the development of more robust and reliable news analysis models.

Owner

Name: Syed Mohd Gulam Baquer
Login: baquer
Kind: user
Location: New Delhi
Company: IIIT Delhi

Repositories: 3
Profile: https://github.com/baquer

M.Tech CSE @ IIIT Delhi || iOS native|| Backend || ML

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Patil"
  given-names: "Suraj"
title: "Question Generation using transformers"
version: 1.0.0
date-released: 2020-07
publisher: "GitHub"
url: "https://github.com/patil-suraj/question_generation"

GitHub Events

Total

Last Year

Committers

Last synced: about 1 year ago

All Time

Total Commits: 14
Total Committers: 2
Avg Commits per committer: 7.0
Development Distribution Score (DDS): 0.214

Past Year

Commits: 0
Committers: 0
Avg Commits per committer: 0.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
baquer	s**0@i**n	11
SehbanFazili	1****f	3

Committer Domains (Top 20 + Academic)

iiitd.ac.in: 1

Issues and Pull Requests

Last synced: about 1 year ago

All Time

Total issues: 0
Total pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Total issue authors: 0
Total pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

facqa

Science Score: 54.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

FacQA

About The Project

Working

Architecture

Screenshot

Limitations

Future Works

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels