A collection of small scripts dealing with problems of a bioinformatic nature

These scripts are inspired by questions/programming challenges on online forums, e.g. biostars and r/bioinformatics.

multifasta.py

Creates a fasta file from files containing sequencing data (e.g. .seq files), using the filename as fasta header. Copy the script to the folder containing the sequencing files and run it. If your files have a different extension, change the file extension in line 10. Inspired by a question on r/bioinformatics.

questionsmark.py

This is apparently the hardest "easy" programming challenge on coderbyte.com. It checks if there are exactly 3 question marks between every pair of numbers that add up to 10. If that is the case, the output is true, otherwise, the output prints false.

range-of-values.py

This was a question asked on the biostars forum: https://www.biostars.org/p/319990/. It finds the range of numbers in the 3rd column for unique entries in the 1st column for the most common value in the 2nd column.

del_duplicate_names.py

From another biostars question: https://www.biostars.org/p/321641/. The script removes duplicate names (and the respective sequence) from a fasta file, but the sequences for duplicate names are unique. Requires fasta file as input and outputs fasta file as result.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
del_duplicate_names.py		del_duplicate_names.py
fasta_in.fa		fasta_in.fa
file1.txt		file1.txt
file2.txt		file2.txt
multifasta.py		multifasta.py
questionmark.py		questionmark.py
range-of-values.py		range-of-values.py
test_A.seq		test_A.seq
test_B.seq		test_B.seq

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A collection of small scripts dealing with problems of a bioinformatic nature

multifasta.py

questionsmark.py

range-of-values.py

del_duplicate_names.py

About

Releases

Packages

Languages

LisaHagenau/small-bioinformatic-scripts

Folders and files

Latest commit

History

Repository files navigation

A collection of small scripts dealing with problems of a bioinformatic nature

multifasta.py

questionsmark.py

range-of-values.py

del_duplicate_names.py

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages