aws-parallelcluster

AWS ParallelCluster is an AWS supported Open Source cluster management tool to deploy and manage HPC clusters in the AWS cloud.

https://github.com/aws/aws-parallelcluster

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    2 of 119 committers (1.7%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.7%) to scientific vocabulary

Keywords from Contributors

autograd build-tools distribution tensor spack radiuss hpsf mesh interpretability standardization
Last synced: 10 months ago · JSON representation

Repository

AWS ParallelCluster is an AWS supported Open Source cluster management tool to deploy and manage HPC clusters in the AWS cloud.

Basic Info
Statistics
  • Stars: 873
  • Watchers: 86
  • Forks: 313
  • Open Issues: 178
  • Releases: 95
Created about 12 years ago · Last pushed 10 months ago
Metadata Files
Readme Changelog Contributing License Code of conduct Codeowners Notice

README.md

AWS ParallelCluster - HPC for the Cloud

PyPI Version Spack Version Conda Verseion License codecov ParallelCluster CI

AWS ParallelCluster is an AWS supported Open Source cluster management tool that makes it easy for you to deploy and manage High Performance Computing (HPC) clusters in the AWS cloud. Built on the Open Source CfnCluster project, AWS ParallelCluster enables you to quickly build an HPC compute environment in AWS. It automatically sets up the required compute resources and a shared filesystem and offers a variety of batch schedulers such as AWS Batch and Slurm. AWS ParallelCluster facilitates both quick start proof of concepts (POCs) and production deployments. You can build higher level workflows, such as a Genomics portal that automates the entire DNA sequencing workflow, on top of AWS ParallelCluster.

Quick Start

IMPORTANT: you will need an Amazon EC2 Key Pair to be able to complete the following steps. Please see the Official AWS Guide.

First, prepare a Python Virtual Environment for ParallelCluster, note ParallelCluster >= 3.0.0 requires Python >= 3.7. python3 -m pip install --upgrade pip python3 -m pip install --user --upgrade virtualenv python3 -m virtualenv ~/hpc-ve source ~/hpc-ve/bin/activate

Make sure you have installed the AWS Command Line Interface:

$ pip3 install awscli

Node.js is required by AWS CDK library used by ParallelCluster. Please reference installation instructions in the AWS CDK documentaton.

Then you can install AWS ParallelCluster:

$ pip3 install aws-parallelcluster

Next, configure your aws credentials and default region:

$ aws configure AWS Access Key ID [None]: YOUR_KEY AWS Secret Access Key [None]: YOUR_SECRET Default region name [us-east-1]: Default output format [None]:

Then, run pcluster configure. A list of valid options will be displayed for each configuration parameter. Type an option number and press Enter to select a specific option, or just press Enter to accept the default option.

``` $ pcluster configure --config /dir/cluster-config.yaml INFO: Configuration file /dir/cluster-config.yaml will be written. Press CTRL-C to interrupt the procedure.

Allowed values for AWS Region ID: 1. eu-north-1 ... 15. us-west-1 16. us-west-2 AWS Region ID [us-east-1]: ```

Be sure to select a region containing the EC2 key pair you wish to use. You can also import a public key using these instructions.

During the process you will be asked to set up your networking environment. The wizard will offer you the choice of using an existing VPC or creating a new one on the fly.

Automate VPC creation? (y/n) [n]:

Enter n if you already have a VPC suitable for the cluster. Otherwise you can let pcluster configure create a VPC for you. The same choice is given for the subnet: you can select a valid subnet ID for both the head node and compute nodes, or you can let pcluster configure set up everything for you. The same choice is given for the subnet configuration: you can select a valid subnet ID for both the head node and compute nodes, or you can let pcluster configure set up everything for you. In the latter case, just select the configuration you prefer.

Automate Subnet creation? (y/n) [y]: y Allowed values for Network Configuration: 1. Head node in a public subnet and compute fleet in a private subnet 2. Head node and compute fleet in the same public subnet

At the end of the process a message like this one will be shown:

Configuration file written to /dir/conf_file You can edit your configuration file or simply run 'pcluster create-cluster --cluster-name cluster-name --cluster-configuration /dir/cluster-config.yaml' to create your cluster.

Now you can create your first cluster:

$ pcluster create-cluster --cluster-name myfirstcluster --cluster-configuration /dir/cluster-config.yaml

After the cluster finishes creating, log in:

$ pcluster ssh --cluster-name myfirstcluster

You can view the running compute hosts:

$ sinfo

For more information on any of these steps see the Getting Started Guide.

Documentation

We've been working hard to greatly improve the Documentation, it's now published in 10 languages, one of the many benefits of being hosted on AWS Docs. Of most interest to new users is the Getting Started Guide.

If you have changes you would like to see in the docs, please either submit feedback using the feedback link at the bottom of each page or create an issue or pull request for the project at: https://github.com/awsdocs/aws-parallelcluster-user-guide.

Issues

GitHub issues GitHub closed issues

Please open a GitHub issue for any feedback or issues: https://github.com/aws/aws-parallelcluster/issues. There is also an active AWS HPC forum which may be helpful: https://repost.aws/tags/TAbl-DsTlyQMe0T2i-d5Rr8g/aws-parallel-cluster.

Changes

CfnCluster to AWS ParallelCluster

In Version 2.0.0, we changed the name of CfnCluster to AWS ParallelCluster. With that name change we released several new features, which you can read about in the Change Log.

Owner

  • Name: Amazon Web Services
  • Login: aws
  • Kind: organization
  • Email: open-source-github@amazon.com
  • Location: United States of America

GitHub Events

Total
  • Create event: 21
  • Release event: 4
  • Issues event: 106
  • Watch event: 43
  • Delete event: 4
  • Issue comment event: 314
  • Push event: 331
  • Gollum event: 31
  • Pull request review comment event: 428
  • Pull request review event: 763
  • Pull request event: 718
  • Fork event: 9
Last Year
  • Create event: 21
  • Release event: 4
  • Issues event: 106
  • Watch event: 43
  • Delete event: 4
  • Issue comment event: 314
  • Push event: 331
  • Gollum event: 31
  • Pull request review comment event: 428
  • Pull request review event: 763
  • Pull request event: 718
  • Fork event: 9

Committers

Last synced: about 1 year ago

All Time
  • Total Commits: 6,098
  • Total Committers: 119
  • Avg Commits per committer: 51.244
  • Development Distribution Score (DDS): 0.872
Past Year
  • Commits: 442
  • Committers: 13
  • Avg Commits per committer: 34.0
  • Development Distribution Score (DDS): 0.624
Top Committers
Name Email Commits
Francesco De Martino f****m@a****m 778
Enrico Usai u****i@a****m 747
Hanwen h****i@a****m 607
Luca Carrogu c****u@a****m 546
Giacomo Marciani m****o@a****m 428
chenwany c****y@a****m 247
Sean Smith s****m@a****m 245
Tim Lane t****e@a****m 207
Dougal Ballantyne d****b@a****m 158
Rex s****c@a****m 146
Yulei Wang y****n@a****m 137
ddeidda d****a@a****m 131
Ermanno Moser e****n@a****m 130
Jacopo De Amicis j****s@a****t 105
Himani Anil Deshpande 7****1 102
Nicola Sirena n****a@a****m 101
Charles c****a@a****m 94
Ryan Anderson n****y@a****m 89
hgreebe 1****e 83
Eddy Mwiti e****i@a****m 79
Francesco Giordano g****r@a****t 76
Dougal Ballantyne d****m 73
ParallelCluster AMI bot e****v@a****m 71
Xuanqi He 9****8 59
Balaji Sridharan f****j@a****m 47
Edoardo Antonini e****n@a****m 44
Delong Meng d****g@a****m 38
Maurizio Melato m****o@a****m 36
Himani Deshpande h****p@a****m 36
David Pratt 1****t 34
and 89 more...

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 321
  • Total pull requests: 1,986
  • Average time to close issues: 7 months
  • Average time to close pull requests: 9 days
  • Total issue authors: 163
  • Total pull request authors: 45
  • Average comments per issue: 2.95
  • Average comments per pull request: 0.45
  • Merged pull requests: 1,663
  • Bot issues: 0
  • Bot pull requests: 32
Past Year
  • Issues: 81
  • Pull requests: 799
  • Average time to close issues: 16 days
  • Average time to close pull requests: 2 days
  • Issue authors: 40
  • Pull request authors: 13
  • Average comments per issue: 1.14
  • Average comments per pull request: 0.11
  • Merged pull requests: 661
  • Bot issues: 0
  • Bot pull requests: 14
Top Authors
Issue Authors
  • gwolski (18)
  • snemir2 (11)
  • francisreyes-tfs (9)
  • gmarciani (8)
  • hanwen-pcluste (7)
  • stefan-maxar (7)
  • himani2411 (6)
  • nyetsche (6)
  • enrico-usai (6)
  • joehellmersNOAA (5)
  • jagga13 (5)
  • hehe7318 (5)
  • rmarable-flaretx (5)
  • Waqiah (5)
  • mrgum (5)
Pull Request Authors
  • gmarciani (459)
  • hanwen-pcluste (269)
  • himani2411 (210)
  • hgreebe (181)
  • hanwen-cluster (175)
  • hehe7318 (139)
  • dreambeyondorange (110)
  • enrico-usai (84)
  • jdeamicis (62)
  • NSsirena (49)
  • lukeseawalker (46)
  • EddyMM (36)
  • dependabot[bot] (19)
  • judysng (18)
  • github-actions[bot] (13)
Top Labels
Issue Labels
3.x (146) enhancement (66) bug (49) closing-soon-if-no-response (26) pending release (23) question (10) known issue (9) backlog (7) update (7) skip-changelog-update (7) waiting customer (4) help wanted (3) 2.x (3) duplicate (2) component:scheduler (1) component:cli (1) Security (1) Scheduler (1) skip-security-exclusions-check (1) Environment (1) docs (1)
Pull Request Labels
skip-changelog-update (1,512) 3.x (931) Test (197) bug (46) docs (43) dependencies (29) Environment (19) enhancement (17) skip-security-exclusions-check (13) Security (11) awsbatch (4) skip-bad-url-suffix-check (4) Code Style (4) Developer Experience (4) Frontend (3) Versioning (3) Observability (2) Compute Fleet (1) Scheduler (1) component:cli (1)

Packages

  • Total packages: 5
  • Total downloads:
    • pypi 97,255 last-month
  • Total dependent packages: 5
    (may contain duplicates)
  • Total dependent repositories: 16
    (may contain duplicates)
  • Total versions: 229
  • Total maintainers: 9
pypi.org: aws-parallelcluster

AWS ParallelCluster is an AWS supported Open Source cluster management tool to deploy and manage HPC clusters in the AWS cloud.

  • Versions: 79
  • Dependent Packages: 5
  • Dependent Repositories: 15
  • Downloads: 95,104 Last month
Rankings
Downloads: 1.3%
Dependent packages count: 1.6%
Stargazers count: 2.3%
Average: 2.4%
Forks count: 2.9%
Dependent repos count: 3.7%
Maintainers (1)
Last synced: 10 months ago
proxy.golang.org: github.com/aws/aws-parallelcluster
  • Versions: 94
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 5.6%
Average: 5.8%
Dependent repos count: 5.9%
Last synced: 10 months ago
pypi.org: aws-parallelcluster-awsbatch-cli

AWS ParallelCluster AWS Batch CLI provides a set of commands to manage AWS Batch resources created by ParallelCluster and AWS Batch jobs.

  • Versions: 6
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Downloads: 2,151 Last month
Rankings
Stargazers count: 2.3%
Forks count: 2.9%
Downloads: 5.4%
Average: 8.5%
Dependent packages count: 10.1%
Dependent repos count: 21.5%
Maintainers (1)
Last synced: 10 months ago
spack.io: aws-parallelcluster

AWS ParallelCluster is an AWS supported Open Source cluster management tool to deploy and manage HPC clusters in the AWS cloud.

  • Versions: 24
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Forks count: 5.4%
Stargazers count: 8.5%
Average: 17.8%
Dependent packages count: 57.3%
Last synced: 10 months ago
conda-forge.org: aws-parallelcluster
  • Versions: 26
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Forks count: 9.2%
Stargazers count: 13.6%
Average: 27.0%
Dependent repos count: 34.0%
Dependent packages count: 51.2%
Last synced: 10 months ago

Dependencies

api/spec/smithy/build.gradle.kts maven
  • software.amazon.smithy:smithy-aws-apigateway-traits 1.22.0 implementation
  • software.amazon.smithy:smithy-aws-traits 1.22.0 implementation
  • software.amazon.smithy:smithy-linters 1.22.0 implementation
  • software.amazon.smithy:smithy-model 1.22.0 implementation
api/client/src/requirements.txt pypi
  • boto3 >=1.16.14
  • python_dateutil >=2.5.3
  • setuptools >=21.0.0
  • urllib3 >=1.26.6
api/client/src/test-requirements.txt pypi
  • pytest-cov >=2.8.1
awsbatch-cli/requirements.txt pypi
  • boto3 >=1.16.14
  • tabulate *
awsbatch-cli/tests/requirements.txt pypi
  • assertpy *
  • pytest *
  • pytest-cov *
  • pytest-datadir *
  • pytest-html *
  • pytest-mock *
  • pytest-xdist *
  • recordclass *
cli/requirements.txt pypi
  • PyYAML *
  • aws-cdk.aws-batch *
  • aws-cdk.aws-cloudformation *
  • aws-cdk.aws-codebuild *
  • aws-cdk.aws-dynamodb *
  • aws-cdk.aws-ec2 *
  • aws-cdk.aws-efs *
  • aws-cdk.aws-events *
  • aws-cdk.aws-fsx *
  • aws-cdk.aws-iam *
  • aws-cdk.aws-imagebuilder *
  • aws-cdk.aws-logs *
  • aws-cdk.aws-route53 *
  • aws-cdk.aws-sqs *
  • aws-cdk.aws-ssm *
  • aws-cdk.core *
  • aws_cdk.aws-cloudwatch *
  • aws_cdk.aws-lambda *
  • boto3 >=1.16.14
  • connexion *
  • flask *
  • jinja2 *
  • jmespath *
  • marshmallow *
  • tabulate *
  • werkzeug *
cli/tests/requirements.txt pypi
  • assertpy *
  • freezegun *
  • jinja2 *
  • munch *
  • pytest *
  • pytest-cov *
  • pytest-datadir *
  • pytest-html *
  • pytest-mock *
  • pytest-xdist *
  • recordclass *
cloudformation/tests/requirements.txt pypi
  • assertpy *
  • cfn-flip *
  • cfn-lint *
  • jinja2 *
  • pytest *
cloudformation/utils/requirements.txt pypi
  • cfn_flip *
scheduler_plugins/plugin_template/tests/requirements.txt pypi
  • cfn_flip *
  • jinja2 *
scheduler_plugins/slurm/tests/requirements.txt pypi
  • cfn-lint *
  • cfn_flip *
  • jinja2 *
tests/integration-tests/requirements.txt pypi
  • PyYAML *
  • argparse *
  • assertpy *
  • boto3 *
  • cfn_flip *
  • click *
  • fabric ==2.6.0
  • filelock *
  • jinja2 *
  • junitparser *
  • matplotlib *
  • pexpect *
  • psutil *
  • pyOpenSSL *
  • pykwalify *
  • pytest *
  • pytest-datadir *
  • pytest-html *
  • pytest-rerunfailures *
  • pytest-sugar *
  • pytest-xdist *
  • requests *
  • retrying *
  • troposphere *
  • untangle *
scheduler_plugins/slurm/artifacts/slurm_plugin_cookbook/third-party/line-4.0.1/Gemfile rubygems
  • danger >= 0
.github/workflows/changelog_enforcer.yml actions
  • actions/checkout v2 composite
  • dangoslen/changelog-enforcer v1.4.0 composite
.github/workflows/ci.yml actions
  • actions/checkout v2 composite
  • actions/setup-java v1 composite
  • actions/setup-python v2 composite
  • actions/upload-artifact master composite
  • dependency-check/Dependency-Check_Action main composite
  • ludeeus/action-shellcheck master composite
  • mikefarah/yq v4.6.3 composite
.github/workflows/closed-issue-message.yml actions
  • aws-actions/closed-issue-message v1 composite
.github/workflows/codeql-analysis.yml actions
  • actions/checkout v2 composite
  • github/codeql-action/analyze v1 composite
  • github/codeql-action/init v1 composite
.github/workflows/no-response.yml actions
  • lee-dohm/no-response v0.5.0 composite
.github/workflows/security_exclusions_checker.yml actions
  • francesco-giordano/gh-pr-content-checker v1.0.0 composite
api/docker/awslambda/Dockerfile docker
  • public.ecr.aws/lambda/python 3.9 build
cli/src/pcluster/resources/batch/docker/alinux2/Dockerfile docker
  • public.ecr.aws/amazonlinux/amazonlinux 2 build