I have previously used awk commands to extract fasta sequence data based on a separate file of header IDs. However, these are not working for the specific example below.
Input fasta sequence file (seq.fasta)
>106677020 product=phosphatidylinositol 3-kinase catalytic subunit type 3-like
tttaaaaaaaaaaaaaaaaaaaaaaaaaaaaatttgtatactcCCTTCAGTGGCTAAAACCTCAGCTACAGGCAGCGAATCAATGAACTGTAGAAAACCATGTTTAGTGCTAGTGGCAAGAACCCTATATGGTGTAAGTTTCAAGTCCAGGTTTTCACTACGCAATAATTTATCCATCAGAGTGATTATTTGGAGAATAAGTTGATCCTGCCTTAAATCATccccatttttgaaaattgctACATATTCTGTACCAGCTGTGGTTACAAATGTTAATCTTGCAGGCATTAATGCTGACTTGAAAAGTGTAGCTTTCTCAGGTATGATgccttttacatatatatttgaatcTAAAGGAAGCGGTAGTGGTTCGAAgtttgagaaattaattttaaatgtatcagTGTCTGCTAGCAATGCCTGAAGTctatccattttcttttttctattcccACTTTCTCTGGCAATAGTTTTCATCAATTTCACtaatttatcaacaaaattttgttgacgtttaataaaattttgacgaTTTTGCCATTCTGGTGGTCCATATTGTAAAGCTTGCAAAAACATCTTCATAACTTTAAGGTACATGCTGCGAGGTTTATTATCTTGCTTAGCTAAATCATGGTCTTCACATTCAGTTAGCAggtaccaataaaaataattggccAATGTACTGTTTTGACATGCCCTATGAATTAAGAAGGAAGCAAGGTCCATTGTTTTGGAATCTTCTACTGAACTCTCTGTTTTAATGTCAACTGATCCAGAAATactttcttcaataattttagtttgagACGATTTTTTGTAGGCTTCTGAAATATCCtcgaaattttcatatttcagagCTTGCACTAATTGTAAAAGATAAAGGAGTAAATCTTCATCTGGAGCCTTTTGTAGTCTGGTAACAGCGTACCTTCGAACTGATGGGTGGGTAAAATAAGGGCTTAATAACTGAAGGGCATCTTCTACTTCCATCGGTGACCACACATGTAACATATGTATCGCTTGTTTCACTTCACCACTTAGGCTCCAATTCACACATTTTAAGAACTTGGTCAAAGCTTTCTTCTGAGAGCTCAAATAAAATCTGAATTTCCACAGTAGATCTTGCTCTTCAGTAGTAAGTTGTTGCGTAGGAGGATAGGCTACTATCCGATTTAAAGTATCTCTAACTGTAGCATTAGGTTTTAAATCTTTATCTGATAATCCACTGCGCCAACTTCGAGCTAAATTATGATGTTTACTTTCCACTAAATTTTCTTGAAGTATTTCTGGGTCATGGACTGTTACAATATCAGGATGGGCATGGAATTGAAAAACTTCATCTCCATCTTGTTCAAACCAAACTATAGAATAGGGTGTCCCATTGACAGTAGCCTGAGGGAATTCAATCATTAGGtataaatattcagaaattcttttctctttatcatttattttctcaatttcacGGAAGGTCAACCTATCCAACCAATCAATGACAGGCATAAACCCATTTCTATGTTTCTTCGCCAATTTTGCAAGTCTTTGCATATTTTCATTACCTGTACCTGGAGTTTTTCCAGGCGTAGTACAATTTTCTGATCCATCAGCAACAACATCAGGCCACACTTTTAAATCCAACATACCTTGACGAAAAACATTATGCTTACCAAACAATGTAATTGAAGAACCTCCAACTGGTCTCATGGTAGATGGGCCAATACAATCATAAATGGTGATTGCCAATATTGCATTACGTGGCAGGTCAGAATACATTATAGGTAGTGTTAACCACTCGCTCCATGTCCAGCGATTTGTAAAATTCTTGTAAGAAGTCAG
>106677021 product=putative transferase CAF17 homolog%2C mitochondrial
AAAGTGTAtacttaatagttaattttaaagtagTAGACAATTTAAGCTAATCGTGATTTACTTGACATTTCTTGATTATAGCTTAAGTTGctctattgtttataatataattggatgaaagtaattatgatataataatttggaagattctgttaatattttgattttaaattaaggatAATGTTGAGATACCCAGTTTCATGTACATTTTTAGGCTATATCAAAAGAAGTATTTTGCTCAGTTGCAGATACAACCACAGTAAAACCGATTTTCGTTTGGAAGAACTCAATCATCGAAAAATTGTAAGATTGTCTGGAGAAGAATCTTCTAACTTTTTGCAAGGTTTGGTCACTAATGATGTCAATAATATCTCTTCTTCAATGTACAccatgtttttaaacaataggGGGCGGATTTTATTTGACTCGATTATCTACCCTGCTAAAGAGAAAGATACCTTCCTGTTGGAGTGTGATTCTCAAGCTATGCACCAATTAATCAAACATCTAAATATGTATaagctcagaaaaaaaattgcaatctCTCTAGCTTCTGAATTGAATGTATGGTGTATTTACAATCCTAAGCTTGTTGATAATTCAAATGAGGCAAAAGTTTCTTCTACCGAGACATTTGATATGAATGctgttgataaaaatttaatgattacacCCGACCCTCGAACTAATTTGTTAGGTTACCGTATTATTGCCAAAGAAGGTGATGAAATACCTAATTTGCCTAAAAGTGATTTGTATACACTATGCCGATATAAATTAGGCATTGGTGAAGGCATTGACGAACTTCTCTTCGAACAAAGTTTTCCTCTTGAAATGAACTGTGATTACCTTAATGGTGTATCTTTTAACAAGGGATGTTACATTGGACAAGAACTGACTGCAAGATCATTTCATACTGGTGTAATTAGGAAACGGTTAATGCCTCTTATTTTTGAGTCTGAGGCACTCGGTATTCCAATAAATACACCTATCGAAGATCCTAATATTACTAGAAAGTCACCCATCGGAAAAGTTAGGACTGTTAAAGGTGTGAATGGTATTGGGTTGATGCGTGTATCCGAAACAATAGAatctaaatctttaaaaataattaattttatggcaAGATCTTATATTCCAGGTTGGTGGCCTGAAGAAACTGTAGAGCAAAcatatgtcaaaataaaaaaataatttattgattatttgta
>106677022 product=uncharacterized LOC106677022
TACAGTTTAAATAGGAGGCAATCTAGTTCCAACGGTCGCAGTACCCCGCCTAGACAAACCCACACCGTTGCCAACATGAAATTCGCTATT
GTTTGCCTGTTGGCAGTCAGTGCAGTGAGCGCCTCTCGCTACAGGAGGTCCCTCGTCGGATGGCCGCTTGGTCTAGCTAGCCACGGAGCGGTCGCTGTAGGACTCTCCCATCCTGGAGCAGTGGCCGTTGGCCTGTCCCATCCAGGAGCAGTAGCCATTGGACCTTCCCACACCGGGTCTGTAGCTGTAGGACCATCACATACCGGATCCATTGCTGTCGGACCATCCCACACAGGATCGATCGCCGTTGGACCTTCCCATACTGGATCAATAGCTGTCGGACCATCCCATACCGGATCAGTAGCTATCGGACCATCTCACACCGGGGCTGTCGTCGCTCCAGGTGTGGTCTTAGCAGCTCCCGCCATTGCTGCACCCCTCATCGCTCCAGTGGCTCCAGCCCTTGCTTTTGGACCCCATGTTGGTCTCCTTGGACTTCATGGAATCCATGGTTAGCTGtctcaaattaattaacattaactaataaagtaaaattttatgacaAATATTCTGCCAAATCTGTTACGTTTGTCTTATGTACAAGtcttgtaaattttagtaaataaatataatcatgtaTCAGTACTACCCAATTATGACAAATACGCCAatataaacaatgta
Input ID file (id.txt)
106677020
106677022
Expected output (output.fasta)
>106677020 product=phosphatidylinositol 3-kinase catalytic subunit type 3-like
tttaaaaaaaaaaaaaaaaaaaaaaaaaaaaatttgtatactcCCTTCAGTGGCTAAAACCTCAGCTACAGGCAGCGAATCAATGAACTGTAGAAAACCATGTTTAGTGCTAGTGGCAAGAACCCTATATGGTGTAAGTTTCAAGTCCAGGTTTTCACTACGCAATAATTTATCCATCAGAGTGATTATTTGGAGAATAAGTTGATCCTGCCTTAAATCATccccatttttgaaaattgctACATATTCTGTACCAGCTGTGGTTACAAATGTTAATCTTGCAGGCATTAATGCTGACTTGAAAAGTGTAGCTTTCTCAGGTATGATgccttttacatatatatttgaatcTAAAGGAAGCGGTAGTGGTTCGAAgtttgagaaattaattttaaatgtatcagTGTCTGCTAGCAATGCCTGAAGTctatccattttcttttttctattcccACTTTCTCTGGCAATAGTTTTCATCAATTTCACtaatttatcaacaaaattttgttgacgtttaataaaattttgacgaTTTTGCCATTCTGGTGGTCCATATTGTAAAGCTTGCAAAAACATCTTCATAACTTTAAGGTACATGCTGCGAGGTTTATTATCTTGCTTAGCTAAATCATGGTCTTCACATTCAGTTAGCAggtaccaataaaaataattggccAATGTACTGTTTTGACATGCCCTATGAATTAAGAAGGAAGCAAGGTCCATTGTTTTGGAATCTTCTACTGAACTCTCTGTTTTAATGTCAACTGATCCAGAAATactttcttcaataattttagtttgagACGATTTTTTGTAGGCTTCTGAAATATCCtcgaaattttcatatttcagagCTTGCACTAATTGTAAAAGATAAAGGAGTAAATCTTCATCTGGAGCCTTTTGTAGTCTGGTAACAGCGTACCTTCGAACTGATGGGTGGGTAAAATAAGGGCTTAATAACTGAAGGGCATCTTCTACTTCCATCGGTGACCACACATGTAACATATGTATCGCTTGTTTCACTTCACCACTTAGGCTCCAATTCACACATTTTAAGAACTTGGTCAAAGCTTTCTTCTGAGAGCTCAAATAAAATCTGAATTTCCACAGTAGATCTTGCTCTTCAGTAGTAAGTTGTTGCGTAGGAGGATAGGCTACTATCCGATTTAAAGTATCTCTAACTGTAGCATTAGGTTTTAAATCTTTATCTGATAATCCACTGCGCCAACTTCGAGCTAAATTATGATGTTTACTTTCCACTAAATTTTCTTGAAGTATTTCTGGGTCATGGACTGTTACAATATCAGGATGGGCATGGAATTGAAAAACTTCATCTCCATCTTGTTCAAACCAAACTATAGAATAGGGTGTCCCATTGACAGTAGCCTGAGGGAATTCAATCATTAGGtataaatattcagaaattcttttctctttatcatttattttctcaatttcacGGAAGGTCAACCTATCCAACCAATCAATGACAGGCATAAACCCATTTCTATGTTTCTTCGCCAATTTTGCAAGTCTTTGCATATTTTCATTACCTGTACCTGGAGTTTTTCCAGGCGTAGTACAATTTTCTGATCCATCAGCAACAACATCAGGCCACACTTTTAAATCCAACATACCTTGACGAAAAACATTATGCTTACCAAACAATGTAATTGAAGAACCTCCAACTGGTCTCATGGTAGATGGGCCAATACAATCATAAATGGTGATTGCCAATATTGCATTACGTGGCAGGTCAGAATACATTATAGGTAGTGTTAACCACTCGCTCCATGTCCAGCGATTTGTAAAATTCTTGTAAGAAGTCAG
>106677022 product=uncharacterized LOC106677022
TACAGTTTAAATAGGAGGCAATCTAGTTCCAACGGTCGCAGTACCCCGCCTAGACAAACCCACACCGTTGCCAACATGAAATTCGCTATT
GTTTGCCTGTTGGCAGTCAGTGCAGTGAGCGCCTCTCGCTACAGGAGGTCCCTCGTCGGATGGCCGCTTGGTCTAGCTAGCCACGGAGCGGTCGCTGTAGGACTCTCCCATCCTGGAGCAGTGGCCGTTGGCCTGTCCCATCCAGGAGCAGTAGCCATTGGACCTTCCCACACCGGGTCTGTAGCTGTAGGACCATCACATACCGGATCCATTGCTGTCGGACCATCCCACACAGGATCGATCGCCGTTGGACCTTCCCATACTGGATCAATAGCTGTCGGACCATCCCATACCGGATCAGTAGCTATCGGACCATCTCACACCGGGGCTGTCGTCGCTCCAGGTGTGGTCTTAGCAGCTCCCGCCATTGCTGCACCCCTCATCGCTCCAGTGGCTCCAGCCCTTGCTTTTGGACCCCATGTTGGTCTCCTTGGACTTCATGGAATCCATGGTTAGCTGtctcaaattaattaacattaactaataaagtaaaattttatgacaAATATTCTGCCAAATCTGTTACGTTTGTCTTATGTACAAGtcttgtaaattttagtaaataaatataatcatgtaTCAGTACTACCCAATTATGACAAATACGCCAatataaacaatgta
I have tried all of the following awk commands, including commands that assume sequence data, most of which come from other posts that have sought to do the same thing. The first command has always worked for me in the past, but it's not clear why it is not in this particular instance; all I get is an empty output file:
awk -F'>' 'NR==FNR{ids[$0]; next} NF>1{f=($2 in ids)} f' id.txt seq.fasta > output.fasta
awk 'NR==FNR{ids[$0];next} /^>/{f=($1 in ids)} f' id.txt seq.fasta > output.fasta
awk -F'>' 'NR==FNR{ids[$0];next} /^>/{f=($1 in ids)} f' id.txt seq.fasta > output.fasta