Skip to main content

import - How do you skip missing data values in a data file?


In scanning through a simple imported file of weather station data, some dates have missing data.


What is a test to use to check whether a datum is valid? I tried If[tempData[index][[4]]≠NaN, ] and If[tempData[index][[4]]≠Indeterminate, ], but neither seemed to work.


Most of my data is valid, and so it is just a matter of skipping/ignoring the occasional missing entry, and not incrementing my accumulators or counters.


The data are already arranged in a Mathematica list, and a missing entry is represented by two adjacent commas with nothing in between.


So, what I need is an If[ ] statement that will trigger (or not trigger) when it encounters such a missing data element.


Here is the code, along with a first snippet of output:


f[x_ /; MemberQ[Range@12, x]] := 
Switch[x, 2, 28, 4 | 6 | 9 | 11, 30, _, 31]


Do[

Do[ sum = 0.0; count = 0; index = 1;

While[index < 20454,

If[Part[tempData, index][[{2, 3}]] == {month, day},

sum = sum + Part[tempData, index][[4]]; count++ ]; index++];


Print[{month, day}, " ", sum, " ", count],

{day, 1, f[month]}

],

{month, 1, 12}

]


{1,1} 87.5 +5 56

{1,2} 60.4 +5 56

{1,3} 59.5 +5 56

{1,4} 54.5 +5 56

{1,5} 37.4 +5 56


{1,6} 39.6 +5 56

I interpret these results to mean that Mathematica found, for these six days in January, the sums of the temperatures for the corresponding days in the 20454 days of data (about 56 years) contained in tempData which had valid data. I think the +5 means that the program encountered 5 Null values, so that the data count should really be 51. If I wanted an average temperature for 01 January based on these data, I would divide 87.5 by 51.


My main concern is not computing the average, or any of the other manipulations I plan to do, but merely how to skip the Null data values. My most recent attempt was to add


If[!Null === Part[tempData, index][[4]], sum = sum + Part[tempData, index][[4]], count++]

but that did worse than not work — the sums and counts are all zero.


Here is a sample of the input data, tempData:


{{1955, 1, 1, 0.4, 1.9, -0.3, 4.2},


{1955, 1, 2, -0.5, 0.5, -1.1, 2.5},

{1955, 1, 3, 0.5, 2.7, -1.9, 0.8},

{1955, 1, 4, 1.4, 2.6, 0.6,"0T"},

{1955, 1, 5, 0.8, 2.3, -0.3, 0},

{1955, 1, 6, 0.7, 3.7, -1.7, 0}, etc. }



Comments

Popular posts from this blog

front end - keyboard shortcut to invoke Insert new matrix

I frequently need to type in some matrices, and the menu command Insert > Table/Matrix > New... allows matrices with lines drawn between columns and rows, which is very helpful. I would like to make a keyboard shortcut for it, but cannot find the relevant frontend token command (4209405) for it. Since the FullForm[] and InputForm[] of matrices with lines drawn between rows and columns is the same as those without lines, it's hard to do this via 3rd party system-wide text expanders (e.g. autohotkey or atext on mac). How does one assign a keyboard shortcut for the menu item Insert > Table/Matrix > New... , preferably using only mathematica? Thanks! Answer In the MenuSetup.tr (for linux located in the $InstallationDirectory/SystemFiles/FrontEnd/TextResources/X/ directory), I changed the line MenuItem["&New...", "CreateGridBoxDialog"] to read MenuItem["&New...", "CreateGridBoxDialog", MenuKey["m", Modifiers-...

How to thread a list

I have data in format data = {{a1, a2}, {b1, b2}, {c1, c2}, {d1, d2}} Tableform: I want to thread it to : tdata = {{{a1, b1}, {a2, b2}}, {{a1, c1}, {a2, c2}}, {{a1, d1}, {a2, d2}}} Tableform: And I would like to do better then pseudofunction[n_] := Transpose[{data2[[1]], data2[[n]]}]; SetAttributes[pseudofunction, Listable]; Range[2, 4] // pseudofunction Here is my benchmark data, where data3 is normal sample of real data. data3 = Drop[ExcelWorkBook[[Column1 ;; Column4]], None, 1]; data2 = {a #, b #, c #, d #} & /@ Range[1, 10^5]; data = RandomReal[{0, 1}, {10^6, 4}]; Here is my benchmark code kptnw[list_] := Transpose[{Table[First@#, {Length@# - 1}], Rest@#}, {3, 1, 2}] &@list kptnw2[list_] := Transpose[{ConstantArray[First@#, Length@# - 1], Rest@#}, {3, 1, 2}] &@list OleksandrR[list_] := Flatten[Outer[List, List@First[list], Rest[list], 1], {{2}, {1, 4}}] paradox2[list_] := Partition[Riffle[list[[1]], #], 2] & /@ Drop[list, 1] RM[list_] := FoldList[Transpose[{First@li...

plotting - How to draw lines between specified dots on ListPlot?

I would like to create a plot where I have unconnected dots and some connected. So far, I have figured out how to draw the dots. My code is the following: ListPlot[{{1, 1}, {2, 2}, {3, 3}, {4, 4}, {1, 4}, {2, 5}, {3, 6}, {4, 7}, {1, 7}, {2, 8}, {3, 9}, {4, 10}, {1, 10}, {2, 11}, {3, 12}, {4,13}, {2.5, 7}}, Ticks -> {{1, 2, 3, 4}, None}, AxesStyle -> Thin, TicksStyle -> Directive[Black, Bold, 12], Mesh -> Full] I have thought using ListLinePlot command, but I don't know how to specify to the command to draw only selected lines between the dots. Do have any suggestions/hints on how to do that? Thank you. Answer One possibility would be to use Epilog with Line : ListPlot[ {{1, 1}, {2, 2}, {3, 3}, {4, 4}, {1, 4}, {2, 5}, {3, 6}, {4, 7}, {1, 7}, {2, 8}, {3, 9}, {4, 10}, {1, 10}, {2, 11}, {3, 12}, {4, 13}, {2.5, 7}}, Ticks -> {{1, 2, 3, 4}, None}, AxesStyle -> Thin, TicksStyle -> Directive[Black, Bold, 12], Mesh -> Full, Epilog -> { Line[ ...