If you notice, there is a pattern by which elements are skipped. This is most evident in a long section of characters without an accent: it skips every other element.
, re.findall()
, , . , .
(\b
) / :
>>> re.findall(r'\b[A-Z]{3}\b[.!?]?',example)
['FAB.', 'APL', 'APL?', 'GJA', 'ADJ', 'AKE', 'EBY', 'ZKE', 'SPR', 'TYL']