Updated 6 months ago

textgen • Rank 12.8 • Science 64%

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。

Updated 6 months ago

vulntrain • Rank 7.6 • Science 67%

A tool to generate datasets and models based on vulnerabilities descriptions from @Vulnerability-Lookup.

Updated 6 months ago

https://github.com/asyml/texar-pytorch • Rank 14.5 • Science 20%

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/

Updated 6 months ago

https://github.com/daniel-furman/sft-demos • Rank 4.4 • Science 13%

Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.

Updated 6 months ago

https://github.com/apn-pucky/metamorph • Science 26%

Metamorph generates new text through repeated translation

Updated 6 months ago

gpt • Science 44%

Indulging the idea that my prompts to the topical hell robot might be original enough to document.

Updated 6 months ago

polyglot-or-not • Science 49%

Are foundation LMs multilingual knowledge bases? (EMNLP 2023)

Updated 6 months ago

minimal-text-diffusion • Science 54%

A minimal implementation of diffusion models for text generation

Updated 6 months ago

postmodern_generator • Science 44%

This repository contains the source code for The Postmodern Generator, a tool designed to generate text that mimics the style of academic postmodern criticism. It offers customizable and extensible text generation features without relying on large language models.

Updated 6 months ago

https://github.com/alexeyev/keras-generating-sentences-from-a-continuous-space • Science 10%

Text Variational Autoencoder inspired by the paper 'Generating Sentences from a Continuous Space' Bowman et al. https://arxiv.org/abs/1511.06349

Updated 6 months ago

https://github.com/camel-lab/gender-rewriting • Science 10%

Code, models, and data for "User-Centric Gender Rewriting". NAACL 2022.

Updated 6 months ago

https://github.com/amazon-science/bold • Science 13%

Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper

Updated 6 months ago

https://github.com/awslabs/gap-text2sql • Science 10%

GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training