https://github.com/beehive-lab/docker-gpullama3.java

Docker build scripts for running GPULlama3.java on GPUs with TornadoVM

https://github.com/beehive-lab/docker-gpullama3.java

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.9%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Docker build scripts for running GPULlama3.java on GPUs with TornadoVM

Basic Info
  • Host: GitHub
  • Owner: beehive-lab
  • License: apache-2.0
  • Language: Shell
  • Default Branch: main
  • Size: 11.7 KB
Statistics
  • Stars: 3
  • Watchers: 0
  • Forks: 2
  • Open Issues: 1
  • Releases: 0
Created about 1 year ago · Last pushed about 1 year ago
Metadata Files
Readme License

README.md

Docker Support for GPULlama3.java on NVIDIA GPUs

Run GPULlama3.java on NVIDIA GPUs using Docker with either OpenCL or PTX support.


🛠 Requirements

  • NVIDIA GPU with compatible drivers
  • NVIDIA Container Toolkit installed
    👉 Install Guide

🚀 Quick Start

1. Pull a Docker Image

OpenCL: bash docker pull beehivelab/gpullama3.java-nvidia-openjdk-opencl:latest

PTX: bash docker pull beehivelab/gpullama3.java-nvidia-openjdk-ptx:latest


2. Clone the Runner Scripts Repository

bash git clone https://github.com/beehive-lab/docker-gpullama3.java cd docker-gpullama3.java


3. Download a Model

Download a model from huggingface, as shown in GPULlama3.java. For example: bash wget https://huggingface.co/beehive-lab/Llama-3.2-1B-Instruct-GGUF-FP16/resolve/main/beehive-llama-3.2-1b-instruct-fp16.gguf


4. Run a Prompt

OpenCL Runner: bash ./dockerized-llama-tornado-nvidia-opencl \ --gpu --opencl --verbose-init \ --model beehive-llama-3.2-1b-instruct-fp16.gguf \ --prompt "tell me a joke"

PTX Runner: bash ./dockerized-llama-tornado-nvidia-ptx \ --gpu --ptx --verbose-init \ --model beehive-llama-3.2-1b-instruct-fp16.gguf \ --prompt "tell me a joke"

Sample Output: Here's one: What do you call a fake noodle? An impasta!


🧱 Build the GPULlama3.java Docker Images

To build the Docker images locally, use the provided build.sh script:

bash ./build.sh

Build Options

Build for NVIDIA GPUs using one of the following flags:

  • --nvidia-jdk21-ocl → Build image with OpenCL support and JDK 21
  • --nvidia-jdk21-ptx → Build image with PTX support and JDK 21

Example: bash ./build.sh --nvidia-jdk21-ocl

This will create a Docker image ready to run GPULlama3.java on NVIDIA GPUs with OpenCL.


License

This project is developed at The University of Manchester, and it is open-source under the Apache 2 license.

Owner

  • Name: Beehive lab
  • Login: beehive-lab
  • Kind: organization
  • Location: United Kingdom

Beehive lab is part of the Advanced Processor Technologies Group at the University of Manchester specializing in hw/sw codesign.

GitHub Events

Total
  • Release event: 1
  • Watch event: 2
  • Push event: 1
  • Create event: 2
Last Year
  • Release event: 1
  • Watch event: 2
  • Push event: 1
  • Create event: 2