Hi Stack Overflow Community. I think I'm trying to code the impossible with matplotlib, so if there is another python library that suits me better, please let me know!
I have a whole amino acid sequence (represented as capital letters in the image) of protein (protein x). This will be my x axis.
I have two excel columns: Disease and Control. These columns contain parts of the entire amino acid sequence of protein x. Sometimes there are several hits where the disease or control column will contain two identical amino acid sites of protein x. I want them to stack on top of each other so that you can see how many hits the disease and control have on protein x.
Incomprehensible? Sorry, here is a sample of what I could come up with with powerpoint.
Amino Acid Comparison

Black text is a reference sequence. Violet is a control. Pink is a disease. Do you make sense now?
I need to do this with a huge dataset, so no, I don’t want to “just use powerpoint for hours.” I also want to do this with any reference sequence of my choice.
I do not ask someone to do their work for me. I need someone to point me in the right direction. Is there a special library? Should I convert everything to numbers and then rewrite as text?
Thanks, and I appreciate any advice.
source
share