https://github.com/trane-dev/trane
An open source python library for automated prediction engineering
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.9%) to scientific vocabulary
Keywords
Repository
An open source python library for automated prediction engineering
Basic Info
- Host: GitHub
- Owner: trane-dev
- License: mit
- Language: Python
- Default Branch: main
- Homepage: https://www.trane.dev
- Size: 29.4 MB
Statistics
- Stars: 45
- Watchers: 11
- Forks: 16
- Open Issues: 26
- Releases: 8
Topics
Metadata Files
README.md
Trane is a software package that automatically generates problems for temporal datasets and produces labels for supervised learning. Its goal is to streamline the machine learning problem-solving process.
Install
Install Trane using pip:
shell
python -m pip install trane
Usage
Here's a quick demonstration of Trane in action:
```python import trane
data, metadata = trane.loadairbnb() problemgenerator = trane.ProblemGenerator( metadata=metadata, entitycolumns=["location"] ) problems = problemgenerator.generate()
for problem in problems[:5]: print(problem) ```
A few of the generated problems:
```
Generated 40 total problems
Classification problems: 5
Regression problems: 35
For each
With Trane's LLM add-on (pip install "trane[llm]"), we can determine the relevant problems with OpenAI:
```python
from trane.llm import analyze
instructions = "determine 5 most relevant problems about user's booking preferences. Do not include 'predict the first/last X' problems"
context = "Airbnb data listings in major cities, including information about hosts, pricing, location, and room type, along with over 5 million historical reviews."
relevantproblems = analyze(
problems=problems,
instructions=instructions,
context=context,
model="gpt-3.5-turbo-16k"
)
for problem in relevantproblems:
print(problem)
print(f'Reasoning: {problem.get_reasoning()}\n')
Output
text
For each
For each
For each
For each
For each
Community
- Questions or Issues? Create a GitHub issue.
- Want to Chat? Join our Slack community.
Cite Trane
If you find Trane beneficial, consider citing our paper:
Ben Schreck, Kalyan Veeramachaneni. What Would a Data Scientist Ask? Automatically Formulating and Solving Predictive Problems. IEEE DSAA 2016, 440-451.
BibTeX entry:
bibtex
@inproceedings{schreck2016would,
title={What Would a Data Scientist Ask? Automatically Formulating and Solving Predictive Problems},
author={Schreck, Benjamin and Veeramachaneni, Kalyan},
booktitle={Data Science and Advanced Analytics (DSAA), 2016 IEEE International Conference on},
pages={440--451},
year={2016},
organization={IEEE}
}
Owner
- Name: Trane
- Login: trane-dev
- Kind: organization
- Repositories: 1
- Profile: https://github.com/trane-dev
GitHub Events
Total
- Watch event: 1
- Delete event: 16
- Push event: 30
- Pull request event: 31
- Create event: 17
Last Year
- Watch event: 1
- Delete event: 16
- Push event: 30
- Pull request event: 31
- Create event: 17
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 0
- Total pull requests: 6
- Average time to close issues: N/A
- Average time to close pull requests: less than a minute
- Total issue authors: 0
- Total pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 5
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 6
- Average time to close issues: N/A
- Average time to close pull requests: less than a minute
- Issue authors: 0
- Pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 5
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- gsheni (1)
Pull Request Authors
- Trane-bot (47)
- gsheni (5)
- dependabot[bot] (1)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- pypi 71 last-month
- Total dependent packages: 0
- Total dependent repositories: 1
- Total versions: 9
- Total maintainers: 2
pypi.org: trane
automatically generate prediction problems and labels for supervised learning.
- Documentation: https://trane.readthedocs.io/
- License: MIT License
-
Latest release: 0.8.0
published about 2 years ago
Rankings
Maintainers (2)
Dependencies
- actions/checkout v3 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- pypa/gh-action-pypi-publish release/v1 composite
- actions/cache v3 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- composeml ==0.10.1 test
- contourpy ==1.0.6 test
- dill ==0.3.6 test
- featuretools ==1.20.0 test
- matplotlib ==3.6.2 test
- numpy ==1.24.1 test
- pandas ==1.5.2 test
- py ==1.11.0 test
- pyparsing ==3.0.9 test
- pytest ==7.2.0 test
- pytest-cov ==4.0.0 test
- pytest-runner ==6.0.0 test
- pytest-xdist ==3.1.0 test
- python-dateutil ==2.8.2 test
- pytz ==2022.7 test
- scikit-learn ==1.2.0 test
- scikit-plot ==0.3.7 test
- scipy ==1.10.0 test
- tornado ==6.2 test
- tqdm ==4.64.1 test
- actions/checkout v3 composite
- actions/setup-python v4 composite
- peter-evans/create-pull-request v4 composite
- peter-evans/enable-pull-request-automerge v3 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- importlib_resources >= 6.0.0
- numpy >= 1.24.2
- pandas >= 2.0.1
- pyarrow >= 11.0.0
- scipy >= 1.10.0
- tqdm >= 4.65.0
- importlib_resources ==6.0.0 test
- numpy ==1.24.2 test
- pandas ==2.0.0 test
- pyarrow ==11.0.0 test
- scipy ==1.10.0 test
- tqdm ==4.65.0 test