https://github.com/connectome-neuprint/neuprinthttp

Implements connectomics REST interface

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (18.8%) to scientific vocabulary

Last synced: 9 months ago · JSON representation

Repository

Implements connectomics REST interface

Basic Info

Host: GitHub
Owner: connectome-neuprint
Language: Go
Default Branch: master
Homepage:
Size: 510 KB

Statistics

Stars: 6
Watchers: 7
Forks: 2
Open Issues: 28
Releases: 5

Created almost 8 years ago · Last pushed 11 months ago

Metadata Files

Readme Support

neuPrintHTTP

Implements a connectomics REST interface that leverages the neuprint data model. neuPrintHTTP can be run in a user authenticated mode or without any authentication. Note: that the authenticated mode (which requires more configuration and setup) is needed to use with neuPrintExplorer web application. The un-authenticated mode is the ideal way to access the neuPrint data programmatically.

Dependencies

Since neuPrint is written in golang, you will need to download and install golang before you can build and run neuPrintHTTP. The build tools for golang are opinionated about the file structure and location of golang projects, but by default the tools will autogenerate the required folders when you go get a project.

Installing

Go must be installed (version 1.16+). neuPrintHTTP supports both file-based logging and Apache Kafka. For basic installation:

Option 1: Clone and build (recommended)

```bash

Clone the repository

git clone https://github.com/connectome-neuprint/neuPrintHTTP.git cd neuPrintHTTP

Build the application

go build

Or install it to your GOPATH's bin directory

go install ```

Option 2: Direct install (requires Go modules)

```bash

Install the latest version

go install github.com/connectome-neuprint/neuPrintHTTP@latest ```

To run tests:

% go test ./...

To test a specific package:

% go test ./api/...

neuprintHTTP uses a python script to support cell type analysis. To use this script, install scipy, scikit-learn, and pandas and make sure to run neuprint HTTP in the top directory where the python script is located.

Data Access Endpoints

Standard JSON Endpoint

The default endpoint for custom queries is /api/custom/custom, which returns results in JSON format:

bash curl -X POST "http://localhost:11000/api/custom/custom" \ -H "Content-Type: application/json" \ -d '{"cypher": "MATCH (n) RETURN n LIMIT 1", "dataset": "hemibrain"}'

The response will be JSON with this structure: json { "columns": ["name", "size"], "data": [["t4", 323131], ["mi1", 232323]] }

Where: - columns: Array of column names from your query - data: Array of rows, each row containing values that correspond to the columns

Apache Arrow Support

neuPrintHTTP supports returning query results in Apache Arrow format via the /api/custom/arrow HTTP endpoint. This provides several advantages:

Efficient binary serialization with low overhead
Preservation of data types
Native integration with data science tools
Optimized memory layout for analytical workloads

neuPrintHTTP uses Arrow v18 for all Arrow-related functionality, including both the HTTP IPC stream format and the preliminary Flight implementation.

Using the Arrow Endpoint

To retrieve data in Arrow format, send a POST request to /api/custom/arrow with the same JSON body format as the regular custom endpoint:

bash curl -X POST "http://localhost:11000/api/custom/arrow" \ -H "Content-Type: application/json" \ -d '{"cypher": "MATCH (n) RETURN n LIMIT 1", "dataset": "hemibrain"}' \ --output data.arrow

The response will be in Arrow IPC stream format with content type application/vnd.apache.arrow.stream. This is a standard way to transfer Arrow data over HTTP without requiring gRPC or Arrow Flight.

You can parse this with Arrow libraries available in multiple languages:

```python

Python example - Standard HTTP with Arrow IPC format (No Flight required)

import os import pyarrow as pa import requests

Get token from environment variable. Token can be found in neuPrintExplorer settings.

Only necessary if authentication is turned on.

token = os.environ.get("NEUPRINTAPPLICATIONCREDENTIALS")

Add the token to the headers

headers = { "Content-Type": "application/json", "Authorization": f"Bearer {token}" }

resp = requests.post('http://localhost:11000/api/custom/arrow', headers=headers, json={"cypher": "MATCH (n) RETURN n LIMIT 1", "dataset": "hemibrain"})

Parse the Arrow IPC stream from the HTTP response

reader = pa.ipc.openstream(pa.pybuffer(resp.content)) table = reader.read_all()

Convert to pandas DataFrame

df = table.to_pandas()

For Neo4j node objects (which are represented as Arrow Maps)

we need a helper function to convert Arrow MapValue objects to Python dictionaries

def convertmapvaluetodict(val): if hasattr(pa.lib, 'MapValue') and isinstance(val, pa.lib.MapValue): return {k.aspy(): v.as_py() for k, v in val.items()} return val

Process Map columns in the DataFrame

for col in df.columns: if hasattr(pa.lib, 'MapValue'): # Use the MapValue approach if available df[col] = df[col].map(lambda x: convertmapvalueto_dict(x) if x is not None else None)

print(df) ```

```javascript // JavaScript example with Arrow JS const response = await fetch('http://localhost:11000/api/custom/arrow', { method: 'POST', headers: {'Content-Type': 'application/json'}, body: JSON.stringify({ cypher: "MATCH (n) RETURN n LIMIT 1", dataset: "hemibrain" }) });

// Get the binary data const arrayBuffer = await response.arrayBuffer(); // Parse the Arrow IPC stream const table = await arrow.tableFromIPC(arrayBuffer); console.log(table.toString()); ```

developers

If modifying the source code and updating the swagger inline comments, update the documentation with:

% go generate

using Apache Kafka for logging

To use Kafka for logging, one must install librdkafka and build neuprint http with the kafka option.

See installation instructions for librdkafka.

And then:

% go install -tags kafka

Installing without kafka support

If you are having trouble building the server, because librdkafka is missing and you don't need to send log messages to a kafka server, then try this build.

%  go get -tags nokafka github.com/connectome-neuprint/neuPrintHTTP

Running

% neuPrintHTTP -port |PORTNUM| config.json

The config file should contain information on the backend datastore that satisfies the connectomics REST API and the location for a file containing a list of authorized users. To test https locally and generate the necessary certificates, run:

% go run $GOROOT/src/crypto/tls/generate_cert.go --host localhost

Command Line Options

bash Usage: neuprintHTTP [OPTIONS] CONFIG.json -port int port to start server (default 11000) -arrow-flight-port int port for Arrow Flight gRPC server (default 11001) -disable-arrow disable Arrow format support (enabled by default) -public_read allow all users read access -proxy-port int proxy port to start server -pid-file string file for pid -verbose verbose mode

Configuration

The server is configured using a JSON file. The configuration specifies database connections, authentication options, and other server settings.

Apache Arrow Configuration

The Arrow support in neuPrintHTTP includes:

Arrow IPC HTTP endpoint: Available at /api/custom/arrow on the main HTTP port
Arrow Flight gRPC server: Runs on a separate port (default: 11001)

To change the Arrow Flight port:

```bash

Start with custom Flight port

neuprintHTTP -arrow-flight-port 12345 config.json ```

To disable Arrow support entirely:

```bash

Disable all Arrow functionality

neuprintHTTP -disable-arrow config.json ```

Sample Configuration

A sample configuration file can be found in config-examples/config.json in this repo:

json { "engine": "neuPrint-bolt", "engine-config": { "server": "<NEO4-SERVER>:7687", "user": "neo4j", "password": "<PASSWORD>" }, "datatypes": { "skeletons": [ { "instance": "<UNIQUE NAME>", "engine": "dvidkv", "engine-config": { "dataset": "hemibrain", "server": "http://<DVIDADDR>", "branch": "<UUID>", "instance": "segmentation_skeletons" } } ] }, "disable-auth": true, "swagger-docs": "<NEUPRINT_HTTP_LOCATION>/swaggerdocs", "log-file": "log.json" }

Note that the Bolt (optimized neo4j protocol) engine neupPrint-bolt is recommended while the older neuPrint-neo4j engine is deprecated. See below.

Neo4j Bolt Driver

neuPrintHTTP now supports the Neo4j Bolt protocol driver, which provides better performance and more accurate handling of large integer values (greater than 53 bits). To use the Bolt driver:

json { "engine": "neuPrint-bolt", "engine-config": { "server": "bolt://localhost:7687", "user": "neo4j", "password": "password", "database": "neo4j" // Optional: database name for Neo4j 4.0+ (omit for Neo4j 3.x) }, "timeout": 600 }

The Bolt driver correctly preserves large integer values (including integers above 2^53) that would be truncated to floating-point by the HTTP JSON API. This is particularly important for precise integer operations on large IDs and counts.

For more detailed configuration options, refer to config/config.go.

No Auth Mode

This is the easiest way to use neuprint http. It launches an http server and does not require user authorization. To use this, just set "disable-auth" to true as above.

Auth mode

There are several options required to use authorization and authentication with Google. Notably, the user must register the application with Google to enable using google authentication. Also, for authoriation one can either specify user information in a static json file (example in this repo) or data can be extracted from Google's cloud datastore with a bit more configuration. See more documentation in config/config.go.

If you're using Google Datastore to manage the list of authorized users, you can use the Google Cloud Console or the Python API. (See below.)

One must also provide https credentials. To get certificates for local testing, run and add the produced files into the config file.

% go run $GOROOT/src/crypto/tls/generate_cert.go --host localhost

Update authorized users list with Google Cloud Console

For adding or removing a single user, it's most convenient to just use the Google Cloud Console.

Start on the "Dashboard" page
Click neuprint_janelia
Click "Query Entities"
Click name=users
Add or delete properties (one per user)
Click the "Save" button at the bottom of the screen.

Update authorized users list with Python

If you're using Google Datastore to manage the list of authorized users, it's convenient to programmatically edit the list with the Google Datastore Python API.

Start by installing the google-cloud-datastore Python package. Also make sure you've got the correct Google Cloud Project selected (or configure GOOGLE_APPLICATION_CREDENTIALS).

conda install -c conda-forge google-cloud-datastore gcloud config set project dvid-em

Here's an example:

```python from google.cloud.datastore import Client, Key, Entity

client = Client()

Fetch the list of users from the appropriate access list

key = client.key('neuprintjanelia', 'users') r = client.query(kind='neuprintjanelia', ancestor=key).fetch(1000)

Extract the "entity", which is dict-like

entity = list(r)[0]

Remove a user

del entity['baduser@gmail.com']

Add some new users

newusers = { 'newuser1@gmail.com': 'readwrite', 'newuser2@gmail.com': 'readwrite' } entity.update(newusers)

Upload

client.put(entity) ```

Owner

Name: NeuPrint
Login: connectome-neuprint
Kind: organization
Email: neuprint@janelia.hhmi.org

Repositories: 19
Profile: https://github.com/connectome-neuprint

GitHub Events

Total

Release event: 6
Watch event: 1
Delete event: 19
Issue comment event: 2
Push event: 36
Pull request event: 36
Create event: 24

Last Year

Release event: 6
Watch event: 1
Delete event: 19
Issue comment event: 2
Push event: 36
Pull request event: 36
Create event: 24

Issues and Pull Requests

Last synced: about 3 years ago

All Time

Total issues: 49
Total pull requests: 13
Average time to close issues: 4 months
Average time to close pull requests: about 8 hours
Total issue authors: 7
Total pull request authors: 5
Average comments per issue: 0.31
Average comments per pull request: 0.08
Merged pull requests: 9
Bot issues: 0
Bot pull requests: 3

Past Year

Issues: 1
Pull requests: 3
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 1
Pull request authors: 1
Average comments per issue: 4.0
Average comments per pull request: 0.0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 3

View more stats

Top Authors

Issue Authors

stephenplaza (25)
nneubarth (9)
neomorphic (8)
stuarteberg (7)
robsv (1)
jakobtroidl (1)
hannahgooden (1)

Pull Request Authors

DocSavage (10)
dependabot[bot] (10)
nneubarth (5)
stephenplaza (3)
jtpdowns (2)
stuarteberg (2)
neomorphic (1)

Top Labels

Issue Labels

enhancement (11) bug (6) to do (1)

Pull Request Labels

dependencies (10) go (4)

Packages

Total packages: 2
Total downloads: unknown

Total dependent packages: 0
(may contain duplicates)
Total dependent repositories: 0
(may contain duplicates)
Total versions: 50

proxy.golang.org: github.com/connectome-neuprint/neuPrintHTTP

Homepage: https://github.com/connectome-neuprint/neuPrintHTTP
Documentation: https://pkg.go.dev/github.com/connectome-neuprint/neuPrintHTTP#section-documentation
Latest release: v1.7.5
published 11 months ago

Versions: 25
Dependent Packages: 0
Dependent Repositories: 0

Rankings

Dependent packages count: 7.0%

Average: 8.2%

Dependent repos count: 9.3%

Last synced: 10 months ago

proxy.golang.org: github.com/connectome-neuprint/neuprinthttp

Homepage: https://github.com/connectome-neuprint/neuprinthttp
Documentation: https://pkg.go.dev/github.com/connectome-neuprint/neuprinthttp#section-documentation
Latest release: v1.7.5
published 11 months ago

Versions: 25
Dependent Packages: 0
Dependent Repositories: 0

Rankings

Dependent packages count: 7.0%

Average: 8.2%

Dependent repos count: 9.3%

Last synced: 10 months ago

Dependencies

go.mod go

cloud.google.com/go/compute v1.6.1
github.com/blang/semver v3.5.1+incompatible
github.com/cespare/xxhash v1.1.0
github.com/cespare/xxhash/v2 v2.1.2
github.com/confluentinc/confluent-kafka-go v1.8.2
github.com/dgraph-io/badger/v3 v3.2103.2
github.com/dgraph-io/ristretto v0.1.0
github.com/dgrijalva/jwt-go v3.2.0+incompatible
github.com/dustin/go-humanize v1.0.0
github.com/gogo/protobuf v1.3.2
github.com/golang-jwt/jwt v3.2.2+incompatible
github.com/golang/glog v1.0.0
github.com/golang/groupcache v0.0.0-20210331224755-41bb18bfe9da
github.com/golang/protobuf v1.5.2
github.com/golang/snappy v0.0.4
github.com/google/flatbuffers v2.0.6+incompatible
github.com/gorilla/context v1.1.1
github.com/gorilla/securecookie v1.1.1
github.com/gorilla/sessions v1.2.1
github.com/janelia-flyem/echo-secure v0.0.0-20220608035251-8811e3d36dfe
github.com/klauspost/compress v1.15.6
github.com/knightjdr/hclust v1.0.2
github.com/labstack/echo-contrib v0.12.0
github.com/labstack/echo/v4 v4.7.2
github.com/labstack/gommon v0.3.1
github.com/mattn/go-colorable v0.1.12
github.com/mattn/go-isatty v0.0.14
github.com/pkg/errors v0.9.1
github.com/satori/go.uuid v1.2.0
github.com/valyala/bytebufferpool v1.0.0
github.com/valyala/fasttemplate v1.2.1
go.opencensus.io v0.23.0
golang.org/x/crypto v0.0.0-20220525230936-793ad666bf5e
golang.org/x/net v0.0.0-20220607020251-c690dde0001d
golang.org/x/oauth2 v0.0.0-20220524215830-622c5d57e401
golang.org/x/sys v0.0.0-20220520151302-bc2c85ada10a
golang.org/x/text v0.3.7
golang.org/x/time v0.0.0-20220411224347-583f2d630306
google.golang.org/appengine v1.6.7
google.golang.org/protobuf v1.28.0
gopkg.in/confluentinc/confluent-kafka-go.v1 v1.8.2

go.sum go

669 dependencies

https://github.com/connectome-neuprint/neuprinthttp

Science Score: 26.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

neuPrintHTTP

Dependencies

Installing

Option 1: Clone and build (recommended)

Clone the repository

Build the application

Or install it to your GOPATH's bin directory

Option 2: Direct install (requires Go modules)

Install the latest version

Data Access Endpoints

Standard JSON Endpoint

Apache Arrow Support

Using the Arrow Endpoint

Python example - Standard HTTP with Arrow IPC format (No Flight required)

Get token from environment variable. Token can be found in neuPrintExplorer settings.

Only necessary if authentication is turned on.

Add the token to the headers

Parse the Arrow IPC stream from the HTTP response

Convert to pandas DataFrame

For Neo4j node objects (which are represented as Arrow Maps)

we need a helper function to convert Arrow MapValue objects to Python dictionaries

Process Map columns in the DataFrame

developers

using Apache Kafka for logging

Installing without kafka support

Running

Command Line Options

Configuration

Apache Arrow Configuration

Start with custom Flight port

Disable all Arrow functionality

Sample Configuration

Neo4j Bolt Driver

No Auth Mode

Auth mode

Update authorized users list with Google Cloud Console

Update authorized users list with Python

Fetch the list of users from the appropriate access list

Extract the "entity", which is dict-like

Remove a user

Add some new users

Upload

Owner

GitHub Events

Total

Last Year

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

proxy.golang.org: github.com/connectome-neuprint/neuPrintHTTP

Rankings

proxy.golang.org: github.com/connectome-neuprint/neuprinthttp

Rankings

Dependencies