vulntrain
A tool to generate datasets and models based on vulnerabilities descriptions from @Vulnerability-Lookup.
https://github.com/deepset-ai/haystack-tutorials
Here you can find all the Tutorials for Haystack 📓
https://github.com/asyml/texar-pytorch
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
https://github.com/csinva/gpt-paper-title-generator
Generating paper titles (and more!) with GPT trained on data scraped from arXiv.
https://github.com/davidpomerenke/scigen.js
Create scientific papers on the fly: SciGen brought to the browser.
https://github.com/daniel-furman/sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
https://github.com/andstor/verified-smart-contracts
:page_facing_up: Verified Ethereum Smart Contract dataset
https://github.com/andstor/verified-smart-contracts-audit
:bug: Verified smart contract dataset with vulnerability labeling
modular-diffusion
Python library for designing and training your own Diffusion Models with PyTorch
https://github.com/apn-pucky/metamorph
Metamorph generates new text through repeated translation
gpt
Indulging the idea that my prompts to the topical hell robot might be original enough to document.
minimal-text-diffusion
A minimal implementation of diffusion models for text generation
postmodern_generator
This repository contains the source code for The Postmodern Generator, a tool designed to generate text that mimics the style of academic postmodern criticism. It offers customizable and extensible text generation features without relying on large language models.
https://github.com/alexeyev/keras-generating-sentences-from-a-continuous-space
Text Variational Autoencoder inspired by the paper 'Generating Sentences from a Continuous Space' Bowman et al. https://arxiv.org/abs/1511.06349
https://github.com/camel-lab/gender-rewriting
Code, models, and data for "User-Centric Gender Rewriting". NAACL 2022.
https://github.com/amazon-science/text_generation_diffusion_llm_topic
Topic Embedding, Text Generation and Modeling using diffusion
https://github.com/amazon-science/bold
Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper
https://github.com/awslabs/gap-text2sql
GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training