Troubleshooting¶

Common issues and solutions when using qpx.

Installation Issues¶

ModuleNotFoundError: No module named 'qpx'¶

Problem: Python cannot find the qpx package after installation.

Solutions:

Ensure you're using the correct Python environment:

which python
which qpxc

Reinstall in the active environment:

pip install --force-reinstall qpx

If using conda, ensure the environment is activated:

conda activate qpx

Python version incompatibility¶

Problem: Installation fails with Python version errors.

Solution: qpx requires Python 3.10 or higher. Check your version:

python --version

If needed, install a compatible Python version:

# Using conda
conda create -n qpx python=3.10
conda activate qpx
pip install qpx

Missing dependencies¶

Problem: Import errors for packages like venn, pyopenms, or anndata.

Solution: Install the missing package:

pip install venn pyopenms anndata

Or install all optional dependencies:

pip install qpx[all]

Conversion Issues¶

File not found errors¶

Problem: FileNotFoundError when running convert commands.

Solutions:

Use absolute paths:

qpxc convert maxquant-psm \
    --msms-file /full/path/to/msms.txt \
    --output-folder /full/path/to/output

Verify the file exists:
```
ls -la path/to/file.txt
```

Memory errors with large files¶

Problem: MemoryError or system becomes unresponsive with large datasets.

Solutions:

Process files in batches if supported by the command
Increase available memory or use a machine with more RAM

For DIA-NN reports, use the --qvalue-threshold to filter data:

qpxc convert diann \
    --report-path report.tsv \
    --qvalue-threshold 0.01 \
    --output-folder ./output

Invalid file format errors¶

Problem: ValueError or parsing errors when reading input files.

Solutions:

Verify the file format matches the expected format for the converter
Check for file corruption:

head -20 input_file.txt

Ensure the file encoding is UTF-8:
```
file input_file.txt
```

Output Issues¶

Empty output files¶

Problem: Parquet files are created but contain no data.

Solutions:

Check if input data passes quality filters (q-value, PEP thresholds)
Verify column names match expected format for the software

Use --verbose flag to see processing details:

qpxc convert maxquant-psm --msms-file msms.txt --output-folder ./ --verbose

Missing columns in output¶

Problem: Expected columns are not present in the output Parquet file.

Solutions:

Check if the input file contains the required source columns
For spectral data, ensure --spectral-data flag is used:

qpxc convert maxquant-psm --msms-file msms.txt --output-folder ./ --spectral-data

Review the Format Specification for required vs optional fields

SDRF Issues¶

Sample name mismatches¶

Problem: Samples in data files don't match SDRF sample names.

Solutions:

Ensure source name column in SDRF matches file names (without extension)
Check for whitespace or case sensitivity issues:

import pandas as pd
sdrf = pd.read_csv('experiment.sdrf.tsv', sep='\t')
print(sdrf['source name'].unique())

Missing factor values¶

Problem: Factor values are not extracted from SDRF.

Solution: Ensure factor columns follow the format factor value[factor_name]:

source name    factor value[disease]    factor value[organism part]
sample1        healthy                  liver
sample2        cancer                   liver

Performance Issues¶

Slow processing¶

Solutions:

Use SSD storage for input/output files
Increase available RAM
For large datasets, consider processing samples in parallel
Use compressed input files (.gz) to reduce I/O

High memory usage¶

Solutions:

Close other applications to free memory
Process smaller batches of data
Use streaming/chunked processing where available

Getting More Help¶

If your issue isn't listed here:

Search existing issues: GitHub Issues
Enable verbose logging: Add --verbose to any command for detailed output
Create a new issue: Include:
qpx version (qpxc --version)
Python version (python --version)
Operating system
Complete error message
Minimal reproducible example
Community support: Check the Community page for additional resources