Skip to main content

security - Do Mathematica notebooks include personal metadata?



I am about to send a Mathematica notebook to someone and I would like to make sure it does not contain any information that could personally identify me. I would also like to produce pdf files from the notebook and would like them to be metadata-free. Do I need to 'scrub' the file from some intrinsic metadata (as you would need to do, carefully, say, for an MS Word file), or do they generically not include any such information?


If they do contain metadata,



  • what can it be?

  • where is it? and

  • how can I scrub it?



Answer



On the encouragement of Kuba, I will collect in an answer the multiple comments (with due credit to rm -rf, Jagra, and Daniel Lichtblau) that do, to a large extent, answer the question. For me, the most important points are the following:





  • Mathematica notebooks do not, by themselves, store explicit personally identifying information such as author names, as you might get with e.g. MS Word, and pdfs printed from it.




  • However, there can be certain types of information in the notebook (including but not limited to cell editing times, custom style sheet locations, and the file path or filename of the notebook) that can show up in the notebook or pdfs printed from it, and which can reveal traits (such as timezone / active times, operating system, or the very use of Mathematica itself) which, depending on context, can end up identifying you. These features can be disabled as follows:



    • To remove the cell editing times,

      • go to Cell $\to$ Notebook History, and

      • click Clear History.




    • To prevent Mathematica from recording cell editing times,

      • go to Cell $\to$ Notebook History, and

      • uncheck the Track Notebook History option.



    • To remove the file outline cache,


      • go to Format $\to$ Options Inspector,

      • choose Show Option Values $\to$ Selected Notebook,

      • go to Notebook Options $\to$ PrivateNotebookOptions, and

      • change "FileOutlineCache" to False.




    Doing this will make the notebook much easier to read in a file editor, and therefore help in finding any further information you might wish weren't there.





  • Most importantly, notebooks can be opened externally with any text editor; with the exception of Graphics objects they are quite transparent to read. Thus, if this is a concern, you can and should have a good look at the notebook to see if there's anything there that you'd rather not be.




For the examples above, disabling CellChangeTimes will remove cell editing times from the notebook; turning off FileOutlineCache in the options inspector will leave only the notebook expressions behind, though paths to the stylesheets will remain. If you print a pdf from the notebook, it may still be a good idea to run it through a pdf metadata removal tool such as this one if metadata is an issue, just to be sure. The notebooks themselves, though, will have nothing in them that's not visible through a text editor.


Comments

Popular posts from this blog

front end - keyboard shortcut to invoke Insert new matrix

I frequently need to type in some matrices, and the menu command Insert > Table/Matrix > New... allows matrices with lines drawn between columns and rows, which is very helpful. I would like to make a keyboard shortcut for it, but cannot find the relevant frontend token command (4209405) for it. Since the FullForm[] and InputForm[] of matrices with lines drawn between rows and columns is the same as those without lines, it's hard to do this via 3rd party system-wide text expanders (e.g. autohotkey or atext on mac). How does one assign a keyboard shortcut for the menu item Insert > Table/Matrix > New... , preferably using only mathematica? Thanks! Answer In the MenuSetup.tr (for linux located in the $InstallationDirectory/SystemFiles/FrontEnd/TextResources/X/ directory), I changed the line MenuItem["&New...", "CreateGridBoxDialog"] to read MenuItem["&New...", "CreateGridBoxDialog", MenuKey["m", Modifiers-...

How to thread a list

I have data in format data = {{a1, a2}, {b1, b2}, {c1, c2}, {d1, d2}} Tableform: I want to thread it to : tdata = {{{a1, b1}, {a2, b2}}, {{a1, c1}, {a2, c2}}, {{a1, d1}, {a2, d2}}} Tableform: And I would like to do better then pseudofunction[n_] := Transpose[{data2[[1]], data2[[n]]}]; SetAttributes[pseudofunction, Listable]; Range[2, 4] // pseudofunction Here is my benchmark data, where data3 is normal sample of real data. data3 = Drop[ExcelWorkBook[[Column1 ;; Column4]], None, 1]; data2 = {a #, b #, c #, d #} & /@ Range[1, 10^5]; data = RandomReal[{0, 1}, {10^6, 4}]; Here is my benchmark code kptnw[list_] := Transpose[{Table[First@#, {Length@# - 1}], Rest@#}, {3, 1, 2}] &@list kptnw2[list_] := Transpose[{ConstantArray[First@#, Length@# - 1], Rest@#}, {3, 1, 2}] &@list OleksandrR[list_] := Flatten[Outer[List, List@First[list], Rest[list], 1], {{2}, {1, 4}}] paradox2[list_] := Partition[Riffle[list[[1]], #], 2] & /@ Drop[list, 1] RM[list_] := FoldList[Transpose[{First@li...

plotting - How to draw lines between specified dots on ListPlot?

I would like to create a plot where I have unconnected dots and some connected. So far, I have figured out how to draw the dots. My code is the following: ListPlot[{{1, 1}, {2, 2}, {3, 3}, {4, 4}, {1, 4}, {2, 5}, {3, 6}, {4, 7}, {1, 7}, {2, 8}, {3, 9}, {4, 10}, {1, 10}, {2, 11}, {3, 12}, {4,13}, {2.5, 7}}, Ticks -> {{1, 2, 3, 4}, None}, AxesStyle -> Thin, TicksStyle -> Directive[Black, Bold, 12], Mesh -> Full] I have thought using ListLinePlot command, but I don't know how to specify to the command to draw only selected lines between the dots. Do have any suggestions/hints on how to do that? Thank you. Answer One possibility would be to use Epilog with Line : ListPlot[ {{1, 1}, {2, 2}, {3, 3}, {4, 4}, {1, 4}, {2, 5}, {3, 6}, {4, 7}, {1, 7}, {2, 8}, {3, 9}, {4, 10}, {1, 10}, {2, 11}, {3, 12}, {4, 13}, {2.5, 7}}, Ticks -> {{1, 2, 3, 4}, None}, AxesStyle -> Thin, TicksStyle -> Directive[Black, Bold, 12], Mesh -> Full, Epilog -> { Line[ ...