Skip to main content

gathering - Collecting lists using Reap/Sow


I have a list like {{a,x},{b,y},{c,x},{d,z},{e,y}} and would like to produce from it a list formed by collecting all top-level elements whose second components are the same into a single list. For the list above, the desired output would be { {{a,c},x}, {{b,e},y}, {d, z}}. Here x, y, and z could also be lists, if that matters. This seems like an application of Reap and Sow. But the on-line documentation of these functions is pretty poor, and the examples don't add much. Shifrin's book keeps referring to a discussion of Reap and Sow in Part II, but I can't actually find a part II. Is there a resource somewhere that I can look at to understand the ins and outs of these functions, and see some real examples of how to use them?



The simple Reap[Map[Sow[#, #[[2]]] &, r]], where r is the list above, produces almost what I want, but instead of {{a,c},x} above, it produces {{a,x},{c,x}}.


(I'm sure that someone will post a solution to my programming problem; for that I would be grateful. But I really would like to understand these functions better.)


EDIT: The code Reap[Map[Sow[#[[1]], #[[2]]] &, r], _, {#2, #1} &] // Rest does the trick. Perhaps there's a better way. In any case, my question about documentation still stands.



Answer



This question may be deemed a duplicate, as the specific operation has been addressed before (actually quite a few times as I recall). Nevertheless I shall attempt to provide a useful answer regarding Sow and Reap. First, I there are many examples of the use of these functions on this site, and I encourage you to search for them, as they will provide a broader application than is practical in a single answer.


For your given example you could write:


data = {{a, x}, {b, y}, {c, x}, {d, z}, {e, y}};

Reap[Sow @@@ data, _, {#2, #} &][[2]]



{{{a, c}, x}, {{b, e}, y}, {{d}, z}}

This doesn't quite match your desired output but it is a more consistent format.


However, you mention: "Here x, y, and z could also be lists, if that matters." Yes, it does matter, because if the second argument of Sow is a list it is not treated as a single tag but a list of tags. Therefore you need to wrap the list itself in {} to have it treated as a single tag. First an example of the failure:


x = {1, 2, 3};

Reap[Sow @@@ data, _, {#2, #} &][[2]]



{{{a, c}, 1}, {{a, c}, 2}, {{a, c}, 3}, {{b, e}, y}, {{d}, z}}

And then the correction:


Reap[Sow[#, {#2}] & @@@ data, _, {#2, #} &][[2]]


{{{a, c}, {1, 2, 3}}, {{b, e}, y}, {{d}, z}}



A few posts you should read:




Perhaps also of interest:





Version 10 update: although specifically Reap and Sow were requested I think it is worth noting that this can be done with Associations as well, e.g.:


{#2, #} & @@@ Normal @ GroupBy[data, Last -> First]


{{{a, c}, x}, {{b, e}, y}, {{d}, z}}


Comments

Popular posts from this blog

functions - Get leading series expansion term?

Given a function f[x] , I would like to have a function leadingSeries that returns just the leading term in the series around x=0 . For example: leadingSeries[(1/x + 2)/(4 + 1/x^2 + x)] x and leadingSeries[(1/x + 2 + (1 - 1/x^3)/4)/(4 + x)] -(1/(16 x^3)) Is there such a function in Mathematica? Or maybe one can implement it efficiently? EDIT I finally went with the following implementation, based on Carl Woll 's answer: lds[ex_,x_]:=( (ex/.x->(x+O[x]^2))/.SeriesData[U_,Z_,L_List,Mi_,Ma_,De_]:>SeriesData[U,Z,{L[[1]]},Mi,Mi+1,De]//Quiet//Normal) The advantage is, that this one also properly works with functions whose leading term is a constant: lds[Exp[x],x] 1 Answer Update 1 Updated to eliminate SeriesData and to not return additional terms Perhaps you could use: leadingSeries[expr_, x_] := Normal[expr /. x->(x+O[x]^2) /. a_List :> Take[a, 1]] Then for your examples: leadingSeries[(1/x + 2)/(4 + 1/x^2 + x), x] leadingSeries[Exp[x], x] leadingSeries[(1/x + 2 + (1 - 1/x...

mathematical optimization - Minimizing using indices, error: Part::pkspec1: The expression cannot be used as a part specification

I want to use Minimize where the variables to minimize are indices pointing into an array. Here a MWE that hopefully shows what my problem is. vars = u@# & /@ Range[3]; cons = Flatten@ { Table[(u[j] != #) & /@ vars[[j + 1 ;; -1]], {j, 1, 3 - 1}], 1 vec1 = {1, 2, 3}; vec2 = {1, 2, 3}; Minimize[{Total@((vec1[[#]] - vec2[[u[#]]])^2 & /@ Range[1, 3]), cons}, vars, Integers] The error I get: Part::pkspec1: The expression u[1] cannot be used as a part specification. >> Answer Ok, it seems that one can get around Mathematica trying to evaluate vec2[[u[1]]] too early by using the function Indexed[vec2,u[1]] . The working MWE would then look like the following: vars = u@# & /@ Range[3]; cons = Flatten@{ Table[(u[j] != #) & /@ vars[[j + 1 ;; -1]], {j, 1, 3 - 1}], 1 vec1 = {1, 2, 3}; vec2 = {1, 2, 3}; NMinimize[ {Total@((vec1[[#]] - Indexed[vec2, u[#]])^2 & /@ R...

How to remap graph properties?

Graph objects support both custom properties, which do not have special meanings, and standard properties, which may be used by some functions. When importing from formats such as GraphML, we usually get a result with custom properties. What is the simplest way to remap one property to another, e.g. to remap a custom property to a standard one so it can be used with various functions? Example: Let's get Zachary's karate club network with edge weights and vertex names from here: http://nexus.igraph.org/api/dataset_info?id=1&format=html g = Import[ "http://nexus.igraph.org/api/dataset?id=1&format=GraphML", {"ZIP", "karate.GraphML"}] I can remap "name" to VertexLabels and "weights" to EdgeWeight like this: sp[prop_][g_] := SetProperty[g, prop] g2 = g // sp[EdgeWeight -> (PropertyValue[{g, #}, "weight"] & /@ EdgeList[g])] // sp[VertexLabels -> (# -> PropertyValue[{g, #}, "name"]...