I have an app that downloads thousands of XML files every night. Many of these have small errors ( they come from an encrypted source ) that Im trying to clean up before I parse them. Each file is checked line by line for noise.
If I have n possible tags in a file with a different list of attributes for each, is there a regex that could be used to look for missing attributes? IE if I have <cat tail='text' meow='text'/> and <dog tail='text' bark='text'/> can I find instances of 'cat' that don't have 'meow' without discarding 'dog' entries? (assume that each line in the file is single XML statement (IE its closed)) and tags aren't nested.)