torbot

Dark Web OSINT Tool

https://github.com/dedsecinside/torbot

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.6%) to scientific vocabulary

Keywords

algorithm crawler dark-web dedsec-inside deepweb go hacking hacktoberfest osint projects psnappz python python-web-crawler python3 security security-tools spider tor tor-network torbot

Keywords from Contributors

distributed interactive sequences cryptocurrencies transformers network-simulation testing-tools observability multi-agents application
Last synced: 6 months ago · JSON representation ·

Repository

Dark Web OSINT Tool

Basic Info
  • Host: GitHub
  • Owner: DedSecInside
  • License: other
  • Language: Python
  • Default Branch: dev
  • Homepage:
  • Size: 13.7 MB
Statistics
  • Stars: 3,288
  • Watchers: 103
  • Forks: 573
  • Open Issues: 7
  • Releases: 10
Topics
algorithm crawler dark-web dedsec-inside deepweb go hacking hacktoberfest osint projects psnappz python python-web-crawler python3 security security-tools spider tor tor-network torbot
Created almost 9 years ago · Last pushed 9 months ago
Metadata Files
Readme Funding License Code of conduct Citation

README.md

                     ████████╗ ██████╗ ██████╗     ██████╗  ██████╗ ████████╗
                     ╚══██╔══╝██╔═══██╗██╔══██╗    ██╔══██╗██╔═████╗╚══██╔══╝
                        ██║   ██║   ██║██████╔╝    ██████╔╝██║██╔██║   ██║
                        ██║   ██║   ██║██╔══██╗    ██╔══██╗████╔╝██║   ██║
                        ██║   ╚██████╔╝██║  ██║    ██████╔╝╚██████╔╝   ██║
                        ╚═╝    ╚═════╝ ╚═╝  ╚═╝    ╚═════╝  ╚═════╝    ╚═╝

                        Open Source Intelligence Tool for the Dark Web

Status/Social links

Pytest Flake8 Slack Invite Code Triage

Features

  1. Onion Crawler (.onion)
  2. Returns page title or host name if no page title is available and address with a short description of the site.
  3. Save links to a database (Not done)
  4. Output the HTML from a site or save it to an HTML file. (Not done)
  5. Save the link tree as a JSON file.
  6. Crawl custom domains
  7. Check if the link is live
  8. Built-in Updater
  9. Build a visual tree of link relationships that can be quickly viewed or saved to a file

...(will be updated)

Dependencies

  • Tor (Optional)
  • Python ^3.9
  • Poetry (Optional)

Python Dependencies

(see pyproject.toml or requirements.txt for more details)

Installation

TorBot

Using venv

  • If using Python ^3.4, sh python -m venv torbot_venv source torbot_venv/bin/activate pip install -r requirements.txt pip install -e . ./main.py --help

Using docker

```sh docker build -t {image_name} .

Running without Tor

docker run {image_name} poetry run python torbot -u https://example.com --depth 2 --visualize tree --save json --disable-socks5

Running with Tor

docker run --network="host" {image_name} poetry run python torbot -u https://example.com --depth 2 --visualize tree --save json --disable-socks5 ```

Options

usage: Gather and analyze data from Tor sites.

optional arguments:
  -u URL, --url URL     Specify a website link to crawl
  --depth DEPTH         Specify max depth of crawler (default 1)
  -h, --help            Show this help message and exit
  --host                Set IP address for SOCKS5 proxy (defaults to 127.0.0.1)
  --port                Set port for SOCKS5 proxy (defaults to 9050)
  -v                    Displays DEBUG level logging, default is INFO
  --version             Show the current version of TorBot.
  --update              Update TorBot to the latest stable version
  -q, --quiet           Prevents display of header and IP address
  --save FORMAT         Save results in a file. (tree, JSON)
  --visualize FORMAT    Visualizes tree of data gathered. (tree, JSON, table)
  -i, --info            Info displays basic info of the scanned site
  --disable-socks5      Executes HTTP requests without using SOCKS5 proxy
  • NOTE: -u is a mandatory for crawling

Read more about torrc here : Torrc

Curated Features

  • [x] Visualization Module Revamp
  • [x] Implement BFS Search for webcrawler
  • [x] Improve stability (Handle errors gracefully, expand test coverage, etc.)
  • [x] Increase test coverage
  • [ ] Save the most recent search results to a database
  • [ ] Randomize Tor Connection (Random Header and Identity)
  • [ ] Keyword/Phrase Search
  • [ ] Social Media Integration
  • [ ] Increase anonymity
  • [ ] Screenshot capture

Contribution Guidelines

Found an issue?

If you face any issues in the project, please let us know by creating a new issue here.

Developer Guidelines

We welcome contributions to this project! Here are a few guidelines to follow:

  1. Fork the repository and create a new branch for your contribution.
  2. Make sure your code passes all tests by running pytest before submitting a pull request to dev branch.
  3. Follow the PEP8 style guide for Python code.
  4. Make sure to add appropriate documentation for any new features or changes.
  5. When submitting a pull request, please provide a detailed description of the changes made.

References

1.  M. Glassman and M. J. Kang, “Intelligence in the internet age: The emergence and evolution of Open Source Intelligence (OSINT),” Comput. Human Behav., vol. 28, no. 2, pp. 673–682, 2012.
2.  D. Bradbury, “In plain view: open source intelligence,” Comput. Fraud Secur., vol. 2011, no. 4, pp. 5–9, 2011.
3.  B. Butler, B. Wardman, and N. Pratt, “REAPER: an automated, scalable solution for mass credential harvesting and OSINT,” 2016 APWG Symp. Electron. Crime Res., pp. 1–10, 2016.
4.  B. Zantout and R. A. Haraty, “I2P Data Communication System I2P Data Communication System,” no. April 2002, 2014.
5.  J. Qin, Y. Zhou, G. Lai, E. Reid, M. Sageman, and H. Chen, “The dark web portal project: collecting and analyzing the presence of terrorist groups on the web,” in Proceedings of the 2005 IEEE international conference on Intelligence and Security Informatics, 2005, pp. 623–624.
6.  D. Moore, T. Rid, D. Moore, and T. Rid, “Cryptopolitik and the Darknet Cryptopolitik and the Darknet,” vol. 6338, 2016.
7.  G. Weimann, “Going dark: Terrorism on the dark Web,” Stud. Confl. Terror., vol. 39, no. 3, pp. 195–206, 2016.
8.  A. T. Zulkarnine, R. Frank, B. Monk, J. Mitchell, and G. Davies, “Surfacing collaborated networks in dark web to find illicit and criminal content,” in Intelligence and Security Informatics (ISI), 2016 IEEE Conference on, 2016, pp. 109–114.
9.  T. Minárik and A.-M. Osula, “Tor does not stink: Use and abuse of the Tor anonymity network from the perspective of law,” Comput. Law Secur. Rev., vol. 32, no. 1, pp. 111–127, 2016.
10. K. Loesing, S. J. Murdoch, and R. Dingledine, “A Case Study on Measuring Statistical Data in the {T}or Anonymity Network,” in Proceedings of the Workshop on Ethics in Computer Security Research (WECSR 2010), 2010.
11. B. Nafziger, “Data Mining in the Dark : Darknet Intelligence Automation,” 2017.
12. I. Sanchez-Rola, D. Balzarotti, and I. Santos, “The onions have eyes: A comprehensive structure and privacy analysis of tor hidden services,” in Proceedings of the 26th International Conference on World Wide Web, 2017, pp. 1251–1260.
13. Mouli VR, Jevitha KP. “Web Services Attacks and Security-A Systematic Literature Review.”, Procedia Computer Science. 2016 Jan 1;93:870-7.
14. Cova M, Felmetsger V, Vigna G. "Vulnerability analysis of web-based applications. InTest and Analysis of Web Services" 2007 (pp. 363-394). Springer, Berlin, Heidelberg.
15. B. R. Holland, “Enabling Open Source Intelligence (OSINT) in private social networks,” 2012.
16. S. Nakamoto, “Bitcoin: A Peer-to-Peer Electronic Cash System,” Cryptogr. Mail. List https//metzdowd.com, 2009.
17. M. Wesam, A. Nabki, E. Fidalgo, E. Alegre, and I. De Paz, “Classifying Illegal Activities on Tor Network Based on Web Textual Contents”, vol. 1, pp. 35–43, 2017.
18. Sathyadevan S, Gangadharan S.“Crime analysis and prediction using data mining”. In Networks & Soft Computing (ICNSC), 2014 First International Conference on 2014 Aug 19 (pp. 406-412). IEEE.
19. Chau M, Chen H. "A machine learning approach to web page filtering using content and structure analysis. Decision Support Systems." 2008 Jan 1;44(2):482-94.
20. Ani R, Jose J, Wilson M, Deepa OS. “Modified Rotation Forest Ensemble Classifier for Medical Diagnosis in Decision Support Systems”, In Progress in Advanced Computing and Intelligent Engineering 2018 (pp. 137-146). Springer, Singapore.
21. Ani R, Augustine A, Akhil N.C. and Deepa O.S., 2016. “Random Forest Ensemble Classifier to Predict the Coronary Heart Disease Using Risk Factors”, In Proceedings of the International Conference on Soft Computing Systems (pp. 701-710). Springer, New Delhi.

Maintainers

All Thanks to Our Contributors

License

GNU Public License

Owner

  • Name: DedSec Inside
  • Login: DedSecInside
  • Kind: organization
  • Email: thepsnarayanan@gmail.com

Citation (CITATION.cff)

# @InProceedings{10.1007/978-981-15-0146-3_19,
# author="Narayanan, P. S.
# and Ani, R.
# and King, Akeem T. L.",
# editor="Ranganathan, G.
# and Chen, Joy
# and Rocha, {\'A}lvaro",
# title="TorBot: Open Source Intelligence Tool for Dark Web",
# booktitle="Inventive Communication and Computational Technologies",
# year="2020",
# publisher="Springer Singapore",
# address="Singapore",
# pages="187--195",
# abstract="The dark web has turned into a dominant source of illegal activities. With several volunteered networks, it is      becoming more difficult to track down these services. Open source intelligence (OSINT) is a technique used to gather intelligence on targets by harvesting publicly available data. Performing OSINT on the Tor network makes it a challenge for both researchers and developers because of the complexity and anonymity of the network. This paper presents a tool which shows OSINT in the dark web. With the use of this tool, researchers and Law Enforcement Agencies can automate their task of crawling and identifying different services in the Tor network. This tool has several features which can help extract different intelligence.",
# isbn="978-981-15-0146-3"
# }

cff-version: 1.2.0
message: "If you use this software, please cite the following paper:"
authors:
  - family-names: P. S.
    given-names: Narayanan
    affiliation: Department of Computer Science and Applications, Amrita Vishwa Vidyapeetham, Amritapuri, Kollam, India
  - family-names: Akeem T. L.
    given-names: King
    affiliation: USPA Technologies
  - family-names: R
    given-names: Ani
    affiliation: Department of Computer Science and Applications, Amrita Vishwa Vidyapeetham, Amritapuri, Kollam, India
keywords:
  - tor
  - research
  - osint
identifiers:
  - type: doi
    value: 10.1007/978-981-15-0146-3_19
license: GNU Public License
reposiory-code: https://github.com/DedSecInside/TorBot
title: TorBot - Open Source Intelligence Tool for Dark Web
date-released: 2020-01-30

GitHub Events

Total
  • Create event: 4
  • Commit comment event: 1
  • Issues event: 6
  • Watch event: 471
  • Delete event: 2
  • Issue comment event: 15
  • Push event: 4
  • Pull request event: 8
  • Pull request review event: 3
  • Fork event: 82
Last Year
  • Create event: 4
  • Commit comment event: 1
  • Issues event: 6
  • Watch event: 471
  • Delete event: 2
  • Issue comment event: 15
  • Push event: 4
  • Pull request event: 8
  • Pull request review event: 3
  • Fork event: 82

Committers

Last synced: 10 months ago

All Time
  • Total Commits: 714
  • Total Committers: 45
  • Avg Commits per committer: 15.867
  • Development Distribution Score (DDS): 0.543
Past Year
  • Commits: 4
  • Committers: 2
  • Avg Commits per committer: 2.0
  • Development Distribution Score (DDS): 0.5
Top Committers
Name Email Commits
Akeem King a****g@g****m 326
PS Narayanan t****n@g****m 207
dependabot[bot] 4****] 29
Pravin Agre a****n@g****m 20
Akeem Tyrell Lamont King a****g@M****l 11
Anthony Suárez n****0@g****m 11
Sooraj s****s@g****m 9
masterugwee v****0@g****m 9
PSN p****n@l****n 6
Christian L Moreno c****m@g****m 6
Manfredi m****a@g****m 5
shivankar-madaan s****n@g****m 5
shaswat-satyam 7****m 5
Tiago Matias t****2@g****m 5
Hossam Al-Dokkani h****o@t****m 4
Yash Mehta y****a 4
Vedant Borkar v****4@g****m 4
SasikaSankalana 7****a 4
Robert r****6@y****m 4
fukusuket 4****t 3
Manfredi Martorana m****i@g****m 3
PS NaraYanan p****n@P****l 3
Artifact a****t@l****n 3
root r****t@l****n 2
pUrGe12 h****8@g****m 2
Rahul Kumar 1****0 2
Olga Chebotaryova o****a@g****m 2
Jack Moulson j****7@g****m 2
Kishan Mehta k****n@s****m 2
Gus a****y@g****m 1
and 15 more...
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 61
  • Total pull requests: 118
  • Average time to close issues: 6 months
  • Average time to close pull requests: about 1 month
  • Total issue authors: 36
  • Total pull request authors: 20
  • Average comments per issue: 3.8
  • Average comments per pull request: 0.77
  • Merged pull requests: 94
  • Bot issues: 0
  • Bot pull requests: 38
Past Year
  • Issues: 4
  • Pull requests: 5
  • Average time to close issues: 15 days
  • Average time to close pull requests: 4 days
  • Issue authors: 3
  • Pull request authors: 4
  • Average comments per issue: 0.0
  • Average comments per pull request: 1.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 2
Top Authors
Issue Authors
  • KingAkeem (16)
  • advanced4 (4)
  • PSNAppz (3)
  • BlAd373 (3)
  • pillocx (2)
  • Balloon666 (2)
  • Idanban12 (2)
  • Fankaren (1)
  • Shaxrezad (1)
  • datalex42 (1)
  • luke2103 (1)
  • 0x0-art (1)
  • anoduck (1)
  • AbdulSami455 (1)
  • johala09 (1)
Pull Request Authors
  • KingAkeem (47)
  • dependabot[bot] (41)
  • PSNAppz (12)
  • fukusuket (2)
  • harshhere905 (2)
  • vedant-z (2)
  • Blacksujit (2)
  • shaswat-satyam (1)
  • rahul0x00 (1)
  • SasikaSankalana (1)
  • mohitd404 (1)
  • thirdbyte (1)
  • NeoLight1010 (1)
  • mirusu400 (1)
  • stefins (1)
Top Labels
Issue Labels
Hacktoberfest (17) Enhancement (11) Help Wanted (10) Good First Issue (10) Bug (8) New Feature (6) Med Priority (6) Issue Fixed (5) Low Priority (4) Ongoing (4) Idea (3) Performance (3) Unit test (2) Major Upgrade (2) Critical (2) Highly Anticipated Feature (2) Dependencies (1) Python (1) Fixed (1)
Pull Request Labels
Dependencies (44) Python (22) Javascript (19) Enhancement (13) To Review (8) Ready To Merge (7) Issue Fixed (7) New Feature (6) Hacktoberfest (6) Med Priority (5) Ongoing (4) Major Upgrade (4) Doc (3) Merge Approved (3) Performance (2) Machine Learning (2) Readme (2) Low Priority (1) Critical (1) Security Patch (1) Bug (1) Trival Fix (1) Good First Issue (1) Breaking Change (1) Highly Anticipated Feature (1) Unit test (1)

Packages

  • Total packages: 2
  • Total downloads: unknown
  • Total dependent packages: 0
    (may contain duplicates)
  • Total dependent repositories: 0
    (may contain duplicates)
  • Total versions: 16
proxy.golang.org: github.com/dedsecinside/torbot
  • Versions: 8
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 7.0%
Average: 8.2%
Dependent repos count: 9.3%
Last synced: 6 months ago
proxy.golang.org: github.com/DedSecInside/TorBot
  • Versions: 8
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 7.0%
Average: 8.2%
Dependent repos count: 9.3%
Last synced: 6 months ago

Dependencies

docker/Dockerfile docker
  • python 3.9 build
poetry.lock pypi
  • altgraph 0.17.2
  • beautifulsoup4 4.11.1
  • certifi 2022.12.7
  • charset-normalizer 2.0.12
  • decorator 5.1.1
  • future 0.18.2
  • idna 3.3
  • igraph 0.9.11
  • joblib 1.2.0
  • macholib 1.16
  • numpy 1.22.4
  • pefile 2022.5.30
  • progress 1.6
  • pyinstaller 5.1
  • pyinstaller-hooks-contrib 2022.7
  • pysocks 1.7.1
  • python-dotenv 0.20.0
  • pywin32-ctypes 0.2.0
  • requests 2.28.0
  • requests-mock 1.9.3
  • scikit-learn 1.1.2
  • scipy 1.9.1
  • setuptools 65.6.3
  • six 1.16.0
  • sklearn 0.0
  • soupsieve 2.3.2.post1
  • termcolor 1.1.0
  • texttable 1.6.4
  • threadpoolctl 3.1.0
  • threadsafe 1.0.0
  • treelib 1.6.1
  • urllib3 1.26.9
  • validators 0.20.0
  • yattag 1.14.0
pyproject.toml pypi
  • PySocks 1.7.1
  • altgraph 0.17.2
  • beautifulsoup4 4.11.1
  • certifi 2022.12.7
  • charset-normalizer 2.0.12
  • decorator 5.1.1
  • idna 3.3
  • igraph 0.9.11
  • joblib 1.2.0
  • macholib 1.16
  • numpy 1.22.4
  • progress 1.6
  • pyinstaller 5.1
  • pyinstaller-hooks-contrib 2022.7
  • python >=3.8,<3.11
  • python-dotenv 0.20.0
  • requests 2.28.0
  • requests-mock 1.9.3
  • scikit-learn 1.1.2
  • scipy 1.9.1
  • six 1.16.0
  • sklearn 0.0
  • soupsieve 2.3.2.post1
  • termcolor 1.1.0
  • texttable 1.6.4
  • threadpoolctl 3.1.0
  • threadsafe 1.0.0
  • treelib ^1.6.1
  • urllib3 1.26.9
  • validators 0.20.0
  • yattag 1.14.0
requirements.txt pypi
  • PySocks ==1.7.1
  • altgraph ==0.17.2
  • beautifulsoup4 ==4.11.1
  • certifi ==2022.12.7
  • charset-normalizer ==2.0.12
  • decorator ==5.1.1
  • ete3 ==3.1.2
  • idna ==3.3
  • igraph ==0.9.11
  • joblib ==1.2.0
  • macholib ==1.16
  • numpy ==1.22.4
  • progress ==1.6
  • pyinstaller ==5.1
  • pyinstaller-hooks-contrib ==2022.7
  • python-dotenv ==0.20.0
  • requests ==2.28.0
  • requests-mock ==1.9.3
  • scikit-learn ==1.1.2
  • scipy ==1.9.1
  • six ==1.16.0
  • sklearn ==0.0
  • soupsieve ==2.3.2.post1
  • termcolor ==1.1.0
  • texttable ==1.6.4
  • threadpoolctl ==3.1.0
  • threadsafe ==1.0.0
  • urllib3 ==1.26.9
  • validators ==0.20.0
  • yattag ==1.14.0
.github/workflows/flake8.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v3 composite
.github/workflows/pylint.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite