apread

Python package for reading catmanAP binary files.

https://github.com/leonbohmann/apreader

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.7%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Python package for reading catmanAP binary files.

Basic Info
  • Host: GitHub
  • Owner: leonbohmann
  • License: mit
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 22.7 MB
Statistics
  • Stars: 12
  • Watchers: 2
  • Forks: 7
  • Open Issues: 0
  • Releases: 30
Created almost 5 years ago · Last pushed over 1 year ago
Metadata Files
Readme License Citation

README.md

apread

previously: Catman Reader

PyPi Upload pyPI - Version PyPI - Downloads Downloads

Support this project

Buy Me A Coffee

Cite this project

If you use this software in any of your work, please cite it using the "Cite this repository" button in the right sidebar or use this:

@software{leonbohmann_APReader, author = {Bohmann, Leon}, doi = {10.5281/zenodo.8369804}, month = sep, title = {{leonbohmann/APReader: v1.1.2}}, url = {https://github.com/leonbohmann/APReader}, version = {v1.1.2}, year = {2023} }

General

Read binary files produced from catmanAP projects directly into python.

CatmanAP procudes .bin files after each measurement. While it is possible to export as a different format (i.e. txt or asc) it's not efficient because one has to change the export format after every measurement. Here comes the treat: Just export as binary and use this package to work with binary files directly.

After reading all channels from the binary file, the channels are analyzed and every measure-channel will receive a reference to a time channel, depending on the amount of entries in the channels and the fact, that the time-channel has to contain "time" or "zeit" in its name. What that means is, that a channel with x entries and the name "time - 1" will be regarded as the time-channel of any other channel with x Data Entries.

Here is an example plot, generated directly from a binary file: apread_demo_out_1

Installation/Update

Anywhere with python (note the uppercase U):

sh pip install -U apread

How it works

The workflow of the package is straight-forward. You supply a binary file created with CatmanAP and the script will read that into python.

First of all, the binary data is analyzed and packaged into seperate Channel objects. When all Channels are created, each Channel.Name will be checked against ([T|t]ime)|([Z|z]eit), which mark time channels which usually are the reference.

These Channels marked as istime are the basis for Groups. Inside a group you will find the ChannelX (time channel) and a bunch of other channels in ChannelsY, which are the channels containing data in that time domain. The corresponding channels inside a Group are found by analyzing their length. Since the total time measured is the same for all groups, it is assumed that Channels with the same data-length belong to the same group. Connecting the matching channels to the group give a structured representation of your measurement data.

Now that the Data is available in python you are free to do with that whatever you want. Until Version 1.0.x there were some features in which you can save the data but that feature has been removed.

Usage

Lets say you produced a file called measurements.bin and you put it in the directory of your python script, then you can create the APReader on that file. It's that simple. The Initialization may take some time depending on how large your .bin-File is.

```python from apread import APReader

reader = APReader('measurements.bin') # this will read in the file ```

Print channels

Afterwards you can access the Channels by accessing the APReader.Channels Member. Channel and Group implement __str__ which will return the name and the length of data inside it.

```python for channel in reader.Channels: print(channel)

"Timechannel 1 - Standard" (120341 Entries)

"T12_ref" (120341 Entries)

"T33" (120341 Entries)

"Timechannel 1 - Quick" (3022344 Entries)

"F1" (3022344 Entries)

"ast089" (3022344 Entries)

```

```python for group in reader.Groups: print(group)

"Timechannel 1 - Standard" (2 Data-channels, 120341 Entries)

"Timechannel 1 - Quick" (2 Data-channels, 3022344 Entries)

```

Plot Channels/Groups

To review your data on the fly, you can plot every entity in the data structure by calling .plot(). When plotting, every group will get its own figure window, in which all connected channels are plotted.

```python

plot the readers data

reader.plot()

plot all groups

for group in reader.Groups: group.plot()

plot all channels

for channel in reader.Channels: channel.plot() ```

As you can see, you can access the channels from the reader, which contains all channels (including time channels) or you can access them from the groups.

There are some more functions to plot specific data. When plotting multiple channels each channel gets its own y-axis.

python group.plotChannel(0) # specific channel group.plotChannels(0,3) # channel 1 to 3 (1,2,3) group.plot([0, 2, 4]) # channel 1, 3 and 5

The same can be applied to the APReader. The only difference is that you can plot specific groups instead of channels.

python reader.plotGroup(0) # specific group reader.plotGroups(0,3) # group 1 to 3 (1,2,3) reader.plot([0, 2, 4]) # group 1, 3 and 5

External Header

Thanks to (hakonbars PR13) you are now able to access external header information using channel.exthdr, a dicitionary containing all keys as described in this sheet.

python ['T0'] # ACQ timestamp info (NOW format) ['dt'] # ACQ delta t in ms ['SensorType'] # IDS code of sensor type ['SupplyVoltage'] # IDS code supply voltage ['FiltChar'] # IDS code of filter characteristics ['FiltFreq'] # IDS code of filter frequency ['TareVal'] # Current value in tare buffer ['ZeroVal'] # Current value in zero adjustment buffer ['MeasRange'] # IDS code of measuring range ['InChar'] # Input characteristics (0=x1,1=y1,2=x2,3=y2) ['SerNo'] # Amplifier serial number ['PhysUnit'] # Physical unit (if user scaling in effect, this is the user unit!) ['NativeUnit'] # Native unit ['Slot'] # Hardware slot number ['SubSlot'] # Sub-channel, 0 if single channel slot ['AmpType'] # IDS code of amplifier type ['APType'] # IDS code of AP connector type (MGCplus only) ['kFactor'] # Gage factor used in strain gage measurements ['bFactor'] # Bridge factor used in strain gage measurements ['MeasSig'] # IDS code of measurement signal (e.g. GROSS, NET) (MGCplus only) ['AmpInput'] # IDS code of amplifier input (ZERO,CAL,MEAS) ['HPFilt'] # IDS code of highpass filter ['OLImportInfo'] # Special information used in online export file headers ['ScaleType'] # 0=Engineering units, 1=Electrical ['SoftwareTareVal'] # Software tare (zero) for channels carrying a user scale ['WriteProtected'] # If true, write access is denied ['NominalRange'] # CAV value ['CLCFactor'] # Cable length compensation factor (CANHEAD only) ['ExportFormat'] # 0=8-Byte Double, 1=4-Byte Single, 2=2-Byte Integer (FOR CATMAN BINARY EXPORT ONLY!)

Parallel reading of data

Only available from version v1.1.1-alpha1 and above

See test/testing.py for a full example. Modify the following around your APReader call:

```python import multiprocessing as mp ...

if name == 'main': # this line has to be included! # without 'processes=...'! pool = mp.Pool()

# pass the pool to the reader
reader = APReader(file, parallelPool=pool)

# make sure to close the pool after you are done with it
mp.close()
mp.join()

```

For the parallel loading to work, you have to define a parallel pool of processes in your top-level script. These processes will be accessed from within APReader-Functions. When passing no arguments to mp.Pool() it will automatically create as many processes as possible, according to the amount of threads your CPU allows (cores + virtual cores). It does not make sense to pass in more, since the APReader spawns the same amount of processes as there are CPU Threads. Increasing the amount of processes in your pool does not increase the amount of parallelism. It is fixed.

Keep in mind, that parallelisation is not always faster. Spawning of processes is expensive and can be wasteful for small files.

The results from APReader stay the same and you can continue your analysis.

Release History

Version 1.1.1-alpha1

  • Added converted timestamp property on channels (Channel.date)
    • Property Channel.time will be deleted at some point in the future...
  • Parallel reading of binary files
    • Max degree of parallelism is automatically set to amount of available cores
  • ----------------------------
  • ----------------------------

Version 1.1

Breaking changes
  • Removed saving functions, this will be up to the user > Since these function change a lot based on current needs, I decided to remove the post-processing functionality completely. The user now needs to do the post-processing on his own, meaning the creation of plots using time and data channels...
Changes
  • (hakonbar PR13) Differentiate floating point precision
  • (hakonbar PR13) Reading additional header information
  • (hakonbar PR13) Supplying binary format reference
  • Fixed null returning string conversion function
  • Using regex to find time channels
  • Improved plotting with multiple axes
  • Printing channels and groups will now give a summary instead of all data

Version 1.0.22

  • Fixed an issue with groups where time channels are not recognized
  • now, user is prompted, when suspected time channel is found
  • plotting is not possible when there is no time-channel found
  • save groups and channels even when there is no time channel

Version 1.0.21

  • Updated serialisation-procedures to always encode in UTF-8

Version 1.0.20

  • Switched to explicit type hinting with typing package (compatibility issues with python <3.9.x)

Version 1.0.15/16

  • Fixed an issue with saving and non-existent directories
  • Added getas to generate formatted string without saving

Version 1.0.14

  • Output file-names updated

Version 1.0.12/13

  • Group channels with their time-channel into "groups"
  • Multiple plot modes:
    • Whole file
    • Channel/Group only
  • Output data
    • json
    • csv

Version 1.0.11

  • Progressbars indicate read-progress of files
  • Multiple plot modes

Version 1.0.0

  • Convert catman files to channels

Meta

Leon Bohmann – mail@leonbohmann.de

Distributed under the MIT license. See LICENSE for more information.

This software comes with no warranty, expressed or implied. Use at your own risk!

https://github.com/leonbohmann/apreader

Owner

  • Name: Leon Bohmann
  • Login: leonbohmann
  • Kind: user
  • Location: Darmstadt, Germany
  • Company: TU Darmstadt

Programming before being able to walk. Civil Engineer at TU Darmstadt.

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: "Bohmann"
    given-names: "Leon"
doi: 10.5281/zenodo.8369804
title: "leonbohmann/APReader: v1.1.2"
version: v1.1.2
date-released: 2023-09-22
url: "https://github.com/leonbohmann/APReader"

GitHub Events

Total
  • Watch event: 3
  • Issue comment event: 5
  • Push event: 1
  • Pull request event: 3
  • Pull request review event: 1
  • Fork event: 1
Last Year
  • Watch event: 3
  • Issue comment event: 5
  • Push event: 1
  • Pull request event: 3
  • Pull request review event: 1
  • Fork event: 1

Committers

Last synced: almost 3 years ago

All Time
  • Total Commits: 111
  • Total Committers: 3
  • Avg Commits per committer: 37.0
  • Development Distribution Score (DDS): 0.09
Top Committers
Name Email Commits
Leon Bohmann l****n@m****t 101
Håkon Line h****e@s****m 7
Leon Bohmann m****l@l****e 3
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 7
  • Total pull requests: 17
  • Average time to close issues: 14 days
  • Average time to close pull requests: 12 days
  • Total issue authors: 6
  • Total pull request authors: 6
  • Average comments per issue: 4.57
  • Average comments per pull request: 1.24
  • Merged pull requests: 14
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 3
  • Average time to close issues: N/A
  • Average time to close pull requests: 18 days
  • Issue authors: 0
  • Pull request authors: 2
  • Average comments per issue: 0
  • Average comments per pull request: 2.67
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • MaximilianailimixaM (2)
  • hakonbar (1)
  • PTR13 (1)
  • Yurnerosk (1)
  • JanKovis (1)
  • leonbohmann (1)
  • airfoxa380 (1)
Pull Request Authors
  • leonbohmann (11)
  • Cangarw (2)
  • J-ECLEMENT (2)
  • pablo-benito (1)
  • hakonbar (1)
  • mca-proto (1)
Top Labels
Issue Labels
bug (1) will be fixed (1)
Pull Request Labels
enhancement (2) will be fixed (1)

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 1,097 last-month
  • Total dependent packages: 1
  • Total dependent repositories: 0
  • Total versions: 40
  • Total maintainers: 1
pypi.org: apread

Import data from CatmanAP binary files.

  • Versions: 40
  • Dependent Packages: 1
  • Dependent Repositories: 0
  • Downloads: 1,097 Last month
Rankings
Dependent packages count: 4.7%
Downloads: 7.5%
Forks count: 16.8%
Stargazers count: 17.7%
Average: 22.8%
Dependent repos count: 67.4%
Maintainers (1)
Last synced: 7 months ago

Dependencies

setup.py pypi
  • matplotlib *
  • pandas *
  • plotly *
  • scipy *
  • tqdm *
  • typing *