Skip to content

Commit 4b68551

Browse files
Update tables in paper.md
1 parent 0821cde commit 4b68551

File tree

1 file changed

+10
-1
lines changed

1 file changed

+10
-1
lines changed

paper.md

+10-1
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ affiliations:
2626
- name: University of Illinois at Chicago
2727
index: 2
2828

29-
date: XYZ February 2025
29+
date: 20 February 2025
3030

3131
bibliography: paper.bib
3232
---
@@ -52,11 +52,17 @@ NEAT can integrate seamlessly with existing bioinformatics workflows, providing
5252
| # | Feature Name | Prior Implementation (v2.0) | Updated Implementation (v4.X) |
5353
|---|-------------|------------------------------|--------------------------------|
5454
| 1 | **BAM File Generation** | File generation was tightly integrated with all NEAT processes | BAM creation was isolated from core functions |
55+
|---|-------------|------------------------------|--------------------------------|
5556
| 2 | **GC Bias Computation** | Used a custom script for GC bias calculation | Feature deprecated |
57+
|---|-------------|------------------------------|--------------------------------|
5658
| 3 | **Ploidy Simulation** | Limited to diploid organisms in practice | Supports unbounded ploidy levels |
59+
|---|-------------|------------------------------|--------------------------------|
5760
| 4 | **Read Generation** | The sliding-window approach to generate reads resulted in artificial gaps in sequencing reads (~50 base pairs) | A new form of coordinate-based read selection eliminates these gaps |
61+
|---|-------------|------------------------------|--------------------------------|
5862
| 5 | **Read Quality Modeling** | Markov-based model | Binning method with an option to also implement a revised Markov-based model |
63+
|---|-------------|------------------------------|--------------------------------|
5964
| 6 | **Variant Insertion** | Issues with inserted variants (loss of genotype data, prevented certain valid variants from insertion) | Preserves genotype data in the final variant call format (VCF) file |
65+
|---|-------------|------------------------------|--------------------------------|
6066
| 7 | **Variant Handling** | The code structure limited the introduction of new variant types | A modular design supports generic variant handling and the separation of insertions and deletions |
6167

6268
The prior implementation of **1** tightly integrated BAM creation with all NEAT functions, leading to inefficiencies. The new update isolates BAM creation, allowing it to be toggled independently, improving runtime and modularity. **2** was removed due to redundancy, as advancements in sequencing technology rendered the custom script unnecessary. Its removal reduced runtime while eliminating associated bugs. **3** has been extended to allow accurate simulation of tumor genomes and polyploid organisms (e.g., plants), with inputs of ploidy greater than two and fractional ploidies now correctly simulating reads. **4** previously introduced artificial read gaps (~50 base pairs) due to a sliding-window approach. The updated coordinate-based selection eliminates these gaps, yielding a dataset that more accurately reflects real sequencing patterns. **5** initially did not achieve a tapering effect on a simulated read's edges. By incorporating a revised Markov model alongside the binning method, the tapering effect was successfully implemented. **6** suffered from loss of genotype data and an arbitrary restriction on certain valid variants. The updated version preserves genotype data in the final VCF file, improving accuracy and giving users greater control over insertions. **7** has been modularized to support structural and copy number variants, increasing flexibility and ensuring future extensibility for handling more complex variants.
@@ -70,8 +76,11 @@ The prior implementation of **1** tightly integrated BAM creation with all NEAT
7076
| # | Feature Name | Prior Implementation (v2.0) | Updated Implementation (v4.X) |
7177
|---|-------------|------------------------------|--------------------------------|
7278
| 1 | **Automated Testing** | No formal testing framework | Implemented continuous integration with GitHub-based automated tests |
79+
|---|-------------|------------------------------|--------------------------------|
7380
| 2 | **Refactored Unit Testing** | Monolithic, unstructured codebase | Rewritten with testable, discrete functions |
81+
|---|-------------|------------------------------|--------------------------------|
7482
| 3 | **Friendly Installation** | Not installable as a package | Fully modular and pip-installable via Poetry |
83+
|---|-------------|------------------------------|--------------------------------|
7584
| 4 | **Configuration Files** | Required explicit command-line flags | Introduced structured configuration files |
7685

7786
**1** was implemented to address the lack of a formal testing structure. The new continuous integration (CI) pipeline detects bugs early, streamlining development and enhancing error detection (e.g., handling of BED files and other inputs). **2** improved debugging and maintenance by transitioning from a monolithic structure to a modular approach with testable, discrete functions, enhancing code integrity and collaboration. **3** was introduced to address the previous lack of package installation support, making NEAT 4.X modular and pip-installable via Poetry, which enhances portability and development ease. Lastly, **4** improved usability, debugging, and reproducibility by replacing cumbersome command-line flags with structured configuration files.

0 commit comments

Comments
 (0)