Recent Releases of segene
segene - v1.13.0 Release
[1.13.0] - 2025-07-23
Added
- New development version: SEgeneanalyzererna:
- eRNAbase-specific analysis tool derived from SEgeneregionpackage
- Command-line interface with 4 specialized commands:
report,single,batch,prepare-stats - pip-installable package with
erna-analyzersystem-wide command - Support for multiple BED files and Parquet metadata format specific to eRNAbase
- Species filtering functionality (human/mouse) for eRNAbase datasets
- Comprehensive bilingual documentation (English and Japanese)
- Enhanced chromosome filtering and validation for genomic region analysis
- WSL-compatible matplotlib backend configuration for headless environments
- Complete test suite with real eRNAbase data validation (858 samples)
- Statistical enrichment analysis with Fisher's exact test and FDR correction
- Multiple output formats: PNG, SVG, PDF, CSV, HTML reports
- Positioned as eRNAbase-specialized counterpart to SEgene_analyzer
Changed
- Documentation updates:
- Added SEgeneanalyzererna to "Development Versions" section in main README files
- Updated usage links to include the new development version documentation
- Python
Published by nswork168 11 months ago
segene - v1.12.0 Release
[1.12.0] - 2025-07-18
Added
- New development version: SEgene_analyzer:
- Development version of SEgeneregionanalyzer with enhanced CLI interface and modern Python packaging
- Command-line interface with 4 specialized commands:
prepare-stats,batch,single,report - pip-installable package with
sedb-analyzersystem-wide command - Advanced caching system for improved batch processing performance
- Comprehensive bilingual documentation (English and Japanese)
- Test suite with 29 test cases covering core functionality
- Enhanced matplotlib backend configuration for WSL environments
- Multiple output formats: PNG, SVG, PDF, EPS
- Positioned as experimental development version alongside stable SEgeneregionanalyzer
Changed
- Documentation structure updates:
- Added "Development Versions" section to main README files
- Updated Program Structure section to mention development versions availability
- Enhanced README files with bold emphasis on development program information
- Improved navigation with direct links to development version documentation
- Python
Published by nswork168 11 months ago
segene - v1.11.0 Release
[1.11.0] - 2025-05-27
Published
- 🎉 SEgene paper officially published:
- Shinkai, N., Asada, K., Machino, H., Takasawa, K., Takahashi, S., Kouno, N., Komatsu, M., Hamamoto, R., & Kaneko, S. (2025). SEgene identifies links between super enhancers and gene expression across cell types. npj Systems Biology and Applications, 11(1), 49. https://doi.org/10.1038/s41540-025-00533-x
- Test data and supplementary materials released on Figshare: https://doi.org/10.6084/m9.figshare.28171127
Changed
- Documentation updated following paper publication:
- Updated all README files to include the official paper citation
- Replaced all "manuscript in preparation" notices with published paper information
- Updated CITATION file with complete publication details and BibTeX format
- Standardized citation section titles in Japanese READMEs to "引用 / Citation"
- Updated Figshare links throughout documentation from placeholders to official DOI
Added
- Citation sections: Added missing citation sections to ensure all components include proper citation information
- Python
Published by nswork168 about 1 year ago
segene - v1.10.0 Release
[1.10.0] - 2025-05-02
Added
- Enhanced SEgene_peakprep with edgeR normalized CPM:
- Added edgeR normalized CPM (calcnorm CPM) as the third quantification method in addition to Standard log2-CPM and BigWig methods
- Implemented integration with R's edgeR package via rpy2 for advanced normalization of ChIP-seq count data
- Added multiple normalization methods from edgeR: upperquartile (default), TMM, RLE
- Implemented optional filtering capabilities based on CPM values across samples
- Added comprehensive documentation in both English and Japanese
- Created a dedicated edgeR normalized CPM guide (cpmcalcnormREADME.md)
Changed
- Updated SEgene_peakprep code and documentation:
- Reorganized CPM method to support both standard log2-CPM and edgeR normalized CPM calculations
- Updated command-line interface with new
--calcnormparameter and related options for normalization control - Improved sample name handling for better compatibility with complex file naming conventions
- Updated all relevant documentation (README.md, READMEja.md, cpmREADME.md, cpmREADMEja.md) to explain the new capabilities
- Dependencies:
- Added R (≥4.2.0), edgeR (≥3.40.0), and rpy2 (≥3.5.0) as new dependencies for the edgeR normalized CPM functionality
- Updated installation instructions to explain the setup of R and Bioconductor environments
- Python
Published by nswork168 about 1 year ago
segene - v1.9.0 Release
[1.9.0] - 2025-04-24
Added
- New component: SEgene_geneprep:
- Preparation tool for RNA-seq input data to add region information for P2GL input formats
- Extracts gene position information from GTF files and merges with expression data from TPM files
- Converts salmon output from nf-core/rnaseq pipeline to CSV format suitable for RNAinput in SEgene analysis
- Supports multiple input formats including standard GTF and salmon TPM files
- Compatible with genome annotations from Illumina iGenomes (e.g., GRCh38)
- Comprehensive documentation in both English and Japanese
- Workflow visualization:
- Added mermaid diagrams to visualize the complete SEgene workflow
- Clear illustration of the four-step process: P2GL Preparation, P2GL Analysis, SE Analysis, and Region Evaluation
Changed
- Program structure reorganization:
- Reorganized workflow into four primary components for clearer understanding
- Positioned SEgenepeakprep and SEgenegeneprep as the initial P2GL data preparation steps
- Updated documentation structure to reflect the new workflow organization
- Documentation updates:
- Comprehensive revision of main README files (both English and Japanese)
- Restructured program structure sections for better clarity
- Enhanced descriptions of component relationships and data flow
- Python
Published by nswork168 about 1 year ago
segene - v1.8.0 Release
[1.8.0] - 2025-04-17
Added
- Enhanced SEgene_peakprep with differential analysis:
- Added differential analysis functionality for ChIP-seq versus Input control data within the bigWig processing workflow.
- This feature utilizes
bamComparefrom thedeeptoolssuite to identify significantly enriched regions by comparing ChIP signals against Input signals.
Changed
- Updated SEgene_peakprep code and documentation:
- Updated relevant scripts within the
SEgene_peakprepdirectory (includingbigwig_peakprep_bamdiff.py,bigwig_peakprep_bamdiff_utils.py,bigwig_peakprep.py, etc.) to integrate the new differential analysis capability. - Correspondingly revised associated documentation (
README.md,README_ja.md,bigwig_README.md,bigwig_README_ja.md) to accurately reflect the updated features and their usage instructions.
- Updated relevant scripts within the
- Python
Published by nswork168 about 1 year ago
segene - v1.7.0 Release
[1.7.0] - 2025-04-14
Added
- New BigWig method in SEgene_peakprep:
- Implemented a new alternative approach for peak quantification using BigWig files
- Added
deeptoolsintegration for BAM to BigWig conversion and signal quantification - Added
bamCoveragefunctionality with multiple normalization options (RPGC, CPM, BPM, RPKM) - Added
multiBigwigSummaryprocessing for extracting signal values from specified regions - Implemented automatic log2 transformation of signal values
- Created comprehensive utilities for processing and handling BigWig files and genomic coordinates
- Enhanced processing workflow:
- Added three-stage workflow with separate scripts for different processing steps
- Added wrapper script for seamless execution of the entire pipeline
- Added option to run individual processing steps separately
Changed
- Expanded SEgene_peakprep capabilities:
- Updated documentation to reflect both CPM and BigWig implementation methods
- Expanded command-line interface to support both processing approaches
- Standardized output formats across both methods for downstream compatibility
Fixed
- Corrected terminology in SEgene_peakprep:
- Fixed references to "mergeSV.tsv" format from v1.6.1, which should correctly be "mergeSE.tsv" (Super-Enhancer) format
- Updated all related documentation and code comments to use consistent terminology
- Fixed
--is_mergesv_formatflag name to--is_mergese_formatfor specifying merge_SE.tsv format annotations
- Python
Published by nswork168 about 1 year ago
segene - v1.6.1 Release
[1.6.1] - 2025-04-11
Added
- SEgene_peakprep improvements:
- Added direct support for BED and merge_SV.tsv input formats in annotation files
- Automatic conversion of BED and merge_SV formats to SAF format internally
- Added
--is_mergesv_formatflag for specifying merge_SV.tsv format annotations
Changed
- SEgene_peakprep workflow optimization:
- Streamlined command-line interface with more intuitive parameter names
- Adjusted log level output for better readability during execution
- Improved temporary file handling
- Documentation updates:
- Comprehensive revision of README files (both English and Japanese)
- Added clearer examples for different annotation file formats
- Updated command examples to reflect the new parameter structure
- Improved troubleshooting section
- Python
Published by nswork168 about 1 year ago
segene - v1.6.0 Release
[1.6.0] - 2025-04-06
Added
- New component: SEgene_peakprep:
- New data preprocessing pipeline for ChIP-seq data normalization
- Integrated processing of peak region information from multiple BAM files into normalized data tables
- Calculation of total mapped reads using
samtools flagstat - Read counting in genomic regions using
featureCounts - CPM calculation and log transformation functionality
- Integration with existing workflow components
- Documentation:
- Documentation for SEgene_peakprep
- Revised workflow description positioning SEgene_peakprep as the initial data preparation step
- Python
Published by nswork168 about 1 year ago
segene - v1.5.0 Release
[1.5.0] - 2025-04-05
Added
- New component: SEgene_RegionAnalyzer:
- Added a new analytical tool for evaluating super-enhancer activity in genomic regions of interest
- Integrated with public databases (currently SEdb 2.0)
- Performs enrichment analysis of tissue-specific super-enhancer associations
- Supports both SEgene output TSV files and standard BED format inputs
- Generates comprehensive TSV reports and visual HTML outputs
- Documentation:
- Added detailed documentation for SEgene_RegionAnalyzer
- Updated main README to include the new component in the project structure
Changed
- Reorganized the project structure to include SEgene_RegionAnalyzer as an optional component
- Updated the workflow description to reflect the extended analytical capabilities
- Python
Published by nswork168 about 1 year ago
segene - v1.4.0 Release
[1.4.0] - 2025-03-19
Added
- Enhanced network visualization capabilities:
- Added detailed graph drawing methods (
draw_network_detailed,draw_subnetwork_detailed,draw_two_layer_subnetwork_detailed) - Added support for multiple output formats (SVG, PNG, EPS, PDF) across all visualization functions
- Added customization options for node styling, edge styling, and subgraph boundaries
- Added detailed graph drawing methods (
- Improved data export functionality:
- Added ability to save graph data alongside visualizations
- Added logging capabilities for reproducibility
- Added support for exporting data in various formats (CSV, TSV, JSON)
- Enhanced visualization parameters:
- Added resolution (DPI) control for raster format outputs
- Added figure size and styling customization options
- Added custom title support for all network visualization methods (
draw_network,draw_subnetwork,draw_two_layer_subnetwork)
Changed
- Updated existing visualization methods to support expanded format options
- Extended graph rendering interfaces with additional customization parameters
- Improved file naming and organization for saved outputs
Notes
- These visualization enhancements are designed to maintain compatibility with existing network analysis functions
Full Changelog: https://github.com/hamamoto-lab/SEgene/compare/v1.3.0...v1.4.0
- Python
Published by nswork168 about 1 year ago
segene - v1.3.0 Release
[1.3.0] - 2025-03-15
Added
- Enhanced visualization options for ROSE summary plots:
- Added customizable parameters for DPI, figure size, and output format
- Improved figure saving capabilities with format selection
Fixed
- Fixed threshold comparison in
p2gl_path_to_filter_dffunction to correct filtering behavior
Notes
- Added pronunciation guide for SEgene ("S-E-gene") in documentation
Full Changelog: https://github.com/hamamoto-lab/SEgene/compare/v1.2.0...v1.3.0
- Python
Published by nswork168 over 1 year ago
segene - v1.2.0 Release
[1.2.0] - 2025-03-11
Added
- New core analysis capabilities for SE rank distribution:
- Added functionality to search for gene-linked super-enhancers across multiple samples
- Implemented super-enhancer ranking analysis within datasets
- Developed statistical tools for percentile calculation and distribution analysis
- Created visualization methods for SE rank distributions across samples
- New tutorial notebooks to demonstrate the new analysis features:
tutorial_book_SE_rank_disribution.ipynb(English)tutorial_book_SE_rank_disribution_ja.ipynb(Japanese)- These notebooks guide users through analyzing the ranking status (position and percentile) of super-enhancers corresponding to specific genes
Notes
- Documentation in README files has been updated to include information about the new notebooks
- Python
Published by nswork168 over 1 year ago
segene - v1.1.0 Release
[1.1.0] - 2025-02-07
Added
- New CLI tools (
cli_tools/) for streamlined featureCounts processing, CPM calculation, and SE–gene correlation analysis- Scripts such as
generate_gtf.py,generate_file_list.sh,run_featurecounts_array.sh, etc. correlation_analysis.pycomputes Pearson correlation between SE (CPM) and genes (TPM)
- Scripts such as
- Documentation updates:
- Added English (
README.md) and Japanese (README_ja.md) documentation under/cli_tools/ - Updated the main repository documentation (
README.md) and/SE_to_gene_links/docs to explain how to integrate the new CLI tools
- Added English (
Changed
- Refined repository structure by separating stand-alone CLI scripts into
cli_tools/, independent ofSEgene_package
Notes
- The newly added documents provide usage instructions and workflow details in both English and Japanese
- Enables advanced or extended analyses on top of the existing
SE_to_gene_linksfunctionalities
- Python
Published by nswork168 over 1 year ago
segene - v1.0.0 Initial Release
v1.0.0 Release Notes
We are excited to announce the initial release of SEgene, a comprehensive platform for identifying and analyzing Super-Enhancer-to-gene links through statistical approaches. This release marks the foundation of a toolset that will enable researchers to explore gene regulation with greater insight.
Detailed Functionality
peaktogene_links
- Retrieves correlation information between gene expression and enhancer peaks.
- Supports data analysis and visualization.
SEtogene_links
- Evaluates and analyzes super-enhancer to gene links using correlation data from
peak_to_gene_links. - Utilizes graph theory for data visualization and interactive analysis with Jupyter Notebook.
- Evaluates and analyzes super-enhancer to gene links using correlation data from
For detailed installation instructions and documentation, please visit our GitHub repository.
Full Changelog: https://github.com/hamamoto-lab/SEgene/commits/v1.0.0
- Python
Published by nswork168 over 1 year ago