https://github.com/ahmetnasri/project-nlp

https://github.com/ahmetnasri/project-nlp

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.4%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: Ahmetnasri
  • Language: Jupyter Notebook
  • Default Branch: main
  • Size: 109 KB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created about 1 year ago · Last pushed about 1 year ago
Metadata Files
Readme

readme.md

NLP Project – DLBAIPNLP01

This repository contains starter code and project templates for the Natural Language Processing course (DLBAIPNLP01) at IU.

📌 Project Options

Choose one of the following NLP projects:


🧠 Task 1: Sentiment Analysis on Movie Reviews

Goal: Classify movie reviews as positive or negative using NLP and machine learning.

Steps: - Collect movie reviews (e.g., Stanford Sentiment Treebank) - Preprocess text (stopword removal, lemmatization/stemming) - Vectorize text (TF-IDF, word embeddings, etc.) - Train a classifier (Naive Bayes, SVM, or Deep Learning) - Evaluate and test the model - Predict sentiment for new reviews


📬 Task 2: Text Classification for Spam Detection

Goal: Classify text messages/emails as spam or legitimate.

Steps: - Use a dataset like UCI Spambase or SMS spam collection - Text preprocessing - Feature extraction (TF-IDF, etc.) - Train and evaluate ML models - Classify new messages


📰 Task 3: Text Classification for Topic Modeling

Goal: Assign documents to topics (multi-class classification).

Steps: - Use labeled datasets like 20 Newsgroups or Reuters Corpus - Preprocess and vectorize the text - Apply supervised models (e.g., Logistic Regression, SVM, etc.) - Predict and evaluate topic classification


🧰 Technologies Used

  • Python 3.x
  • pandas, scikit-learn
  • NLTK / spaCy for text preprocessing
  • Jupyter Notebooks
  • Optional: TensorFlow / PyTorch for deep learning

🚀 How to Run

  1. Clone the repository: ```bash git clone https://github.com/Ahmetnasri/Project-NLP/tree/main cd Project-NLP

Owner

  • Name: Ahmet Nasri
  • Login: Ahmetnasri
  • Kind: user
  • Location: Berlin, Germany

AI and Computer Vision Engineer

GitHub Events

Total
  • Push event: 7
  • Create event: 2
Last Year
  • Push event: 7
  • Create event: 2