Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
1 of 2 committers (50.0%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.4%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: baquer
- License: mit
- Language: Jupyter Notebook
- Default Branch: master
- Size: 2.4 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
FacQA
About The Project
- FacQA is a Fake News Detection App build using python Flask and Question Answer Generation ML Model
- It is a Part of Debunkathon organised by shastra IIT Madras
Working
The completion of the Model/Tool involved several steps in determining the authenticity of an article. The steps are as follows: * Scraping articles: The first step involves collecting articles from various sources on the internet. * Generating Question-Answer pairs: The next step is to generate a Question-Answer pair from the article to be tested for its authenticity. * Generating Answers: For the Questions generated for step 1, we generate answers from the scrapped articles. * Measuring Text Similarity: After the answers have been generated, the text similarity between the answers generated in steps 2 and 3 is measured using BERT. This helps to determine the authenticity of the article by comparing the answers generated with the answers in the article. * Calculating Bleu Score: Along with the BERT score, the Bleu score is also calculated to measure the similarity between the answers generated and the actual answers in the article. * Deploying a Website: The final step is to deploy a website where users can input articles and receive the results of the authenticity check. This makes the tool easily accessible to the general public.
Architecture

Screenshot

Limitations
- When scraping data from websites, we found that there is a limit to the number of requests that can be made before the website blocks further requests. This can hinder data collection and slow down the research process.
- Deploying websites with pre-trained models can make the website slow and less responsive. This is because these models can be quite large and computationally expensive, which can cause delays in processing and returning results to users.
- We also experienced issues with too many dependencies, which can make the project difficult to manage and maintain. Installing and updating multiple packages can be time-consuming and lead to compatibility issues, which can cause errors and bugs in the code.
- The approach is limited to text input only.
Future Works
Some of the possible future work that could be done to improve upon the current approach are as follows: * Predicting news based on the article's URL rather than just the headlines and article text could be a valuable addition to the model. This would involve extracting data from the URL, such as the source or keywords, to gain further insight into the nature of the article and improve prediction accuracy. * As visual media becomes increasingly prominent in news reporting, it is important to be able to distinguish between authentic and fake images. By training a CNN on image datasets, it could be possible to accurately identify images that are being used in a misleading or deceptive way. These potential areas of future work would involve expanding the scope of the research to include additional data sources and types of analysis. By doing so, we could improve the accuracy and comprehensiveness of our predictions, and contribute to the development of more robust and reliable news analysis models.
Owner
- Name: Syed Mohd Gulam Baquer
- Login: baquer
- Kind: user
- Location: New Delhi
- Company: IIIT Delhi
- Repositories: 3
- Profile: https://github.com/baquer
M.Tech CSE @ IIIT Delhi || iOS native|| Backend || ML
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Patil" given-names: "Suraj" title: "Question Generation using transformers" version: 1.0.0 date-released: 2020-07 publisher: "GitHub" url: "https://github.com/patil-suraj/question_generation"
GitHub Events
Total
Last Year
Committers
Last synced: 9 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| baquer | s****0@i****n | 11 |
| SehbanFazili | 1****f | 3 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 9 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0