https://github.com/hcdp/mesonet_station_parser
Code for parsing mesonet loggernet files into Tapis stream
Science Score: 8.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Academic email domains
-
✓Institutional organization owner
Organization hcdp has institutional domain (hawaii.edu) -
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.8%) to scientific vocabulary
Repository
Code for parsing mesonet loggernet files into Tapis stream
Basic Info
- Host: GitHub
- Owner: HCDP
- Language: Python
- Default Branch: main
- Size: 325 KB
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
mesonetstationparser
This repository contains codes that parse mesonet station data (.dat) files into Tapis streams_api.
The repository contains 2 primary script files that achieve this. - streamsprocessor.py - This file parses incoming station data that is stored in a directory(datadir). - There are variations of this file that speed up the parsing process by using Python's multiprocessing module - legacy.py - This file downloads past data by sending API requests to the ikewai gateway, thus allowing us to parse past data into Tapis streams-api. - You are able to provide a startdate and an enddate range for what files you want to parse.
The standard_var folder contains csv files that maps the raw shortnames of the stations' data to the standardized shortnames - Note: the csv files are not extremely accurate as the raw shortnames for the stations' data changes occasionally.
TODO List
- [ ] Ensure that standardize_variable function is working properly (still waiting for HCDP folks to decide on a more permanent variable naming convention)
- [ ] Move username, password, datadir, (startdate and end_date for legacy.py) into a config file for easier editability
- [ ] Potentially add a field in the config file to enable parallelism and define number of workers
- [ ] For production use, remove iteration counter from projectid, siteid and inst_id
Owner
- Name: Hawaii Climate Data Portal
- Login: HCDP
- Kind: organization
- Website: https://hawaii.edu/hcdp
- Repositories: 1
- Profile: https://github.com/HCDP
GitHub Events
Total
- Delete event: 1
- Push event: 8
- Pull request event: 2
- Create event: 1
Last Year
- Delete event: 1
- Push event: 8
- Pull request event: 2
- Create event: 1