I have a list and I want to find (in this particular case the first) appearance of a any of some subsequences, of possible different lengths. None of the subsequences is a subsequence of each other. In my particular case I could do this translating the list to a string and using StringPosition
. But I could do it because all elements on my list were one-character-long. Before realizing this I had implemented a not-nearly-one-liner that did the trick without recurring to Strings. It didn't do any useless comparison but it did lots of useless coping of the list as a whole, and it turned out to be 50 times slower than the StringPosition
version. It can be improved, avoiding that issue, making it even less one-liner. The task just seems too easy to describe so as to be so not-easy to program well... Is there an efficient way to do it for the general case? "Find the first appearance of one of many subsequences (possible different lengths, perhaps could be patterns, or not) in a list"
(Wow, I think I just thought of a good way, I'll give it a shot... If it works I'll auto-answer. But I'd still like your input, I'm afraid I'm missing some options)
Answer
I asked the same question on StackOverflow recently, and the answer that is now my favourite came from Jan Pöschko (modified):
findSubsequence[list_, {ss__}] :=
ReplaceList[list, {pre___, ss, ___} :> Length[{pre}] + 1]
This will find all positions of ss
in list
. Example:
findSubsequence[Range[50] ~Mod~ 17, {4, 5, 6}]
{4, 21, 38}
Despite using patterns, this solution runs very quickly, even for packed arrays. Please see the question I linked to for more possibilities.
A potentially useful generalization to other heads may be had with:
findSubsequence[list : h_[__], _[ss__]] :=
ReplaceList[list, h[pre___, ss, ___] :> Length[{pre}] + 1]
Allowing such forms as:
x = Hold[1 + 1, 2 + 1, 3 + 1, 4 + 1, 2 + 1, 3 + 1, 1 + 1, 2 + 1, 3 + 1];
findSubsequence[x, Hold[2 + 1, 3 + 1]]
{2, 5, 8}
Comments
Post a Comment