Hi Joseph,

I must confess I do not understand at all what the point of dumping on yt at all in this paper was. It’s a very nice paper which shows some cool science and highlights the importance of HPC in running the simulation and visualizing it. If you want to highlight how useful Paraview and/or yt were in visualization, that also fits in nicely with the goals of the paper. But going off on how long it took for one tool vs another to load a file or the inability of one tool to do something that it simply doesn’t have the capability to do (which is the case for many tools) seems incredibly off-topic.

It seems like you were just looking for a venue to comment on yt’s shortcomings and chose this one. If you want to do that, that’s fine, because it is completely valid to point out where software falls short and make suggestions for improvement. But why on earth you felt like an article such as this was the right forum is beyond me. 

Best,

John

On Apr 8, 2019, at 11:39 AM, Joseph Smidt <josephsmidt@gmail.com> wrote:

Hey everyone,

    This tweet [1] was brought to my attention.  There is good criticism in that thread so let me respond:

1. It is ridiculous the YT paper is not cited. It defiantly was last I checked. How that was dropped is beyond me. I will make sure that is somehow fixed.

2. The "single slice plot of a single time step took 13 minutes" is also misleading and is a mistake that I will make sure is somehow corrected.  I ran the simulations and wrote those sections and someone else ran the plotting routines - who is admittedly a Paraview developer - and I should have checked this text more closely.  They were running both YT and Paraview from their Apple Macbook.  The 13 minutes was how long it took that Macbook to volume render the dump versus Paraview,  also running on a Macbook but using an HPC set of nodes in the backend.  It's a feature of Paraview that you can do this. 

    Thus it is very misleading, I am embarrassed I missed it, and will make sure it is corrected. 

3. That said, there are some real deficiencies in YT for HPC-class datasets

  -  The ability to overlay several variables using different color schemes on the same plot. There are times when having a temperature field and metallically field - for example -  overlaying a density field is helpful.  This was mentioned in the paper. 

  - The ability to volume render interactively until you get a plot exactly how you want it. With YT you have to keep running a script over and over again until it looks right.  With paraview - even with a 26 Tb dump on our classified systems - files can be open and rotated and modified in real time to be exactly how you want in like 5 minutes.  This was the cumbersome comment.

  -  Paraview's ability to open up a huge file - again, we have a 26 terabyte file on our classified side we used Paraview - and let the clusters do all the work in minutes from the comfort of your personal Macbook.   A file that YT was unable to ever fit in memory for volume rendering no matter how many nodes we tried.   This failure was not mentioned in the paper, even though some working with such huge datsets may be interested.

  - Another omission from the paper is we did the same thing with gadget data, that is arguably used more than Enzo.  Long story short, YT could never volume render the data, a known issue for at least 5 years [2], whereas paraview both reads and volume renders gadget data natively.  Again, there is no reason to point this out in print even though some would be interested. 

  - The issue with Paraview not reading Enzo natively was true at the time of the analysis but is no longer true.  There is a development version by the above developer that reads Enzo. 

  -  Some of you work with Alex Gagliano. Though he has been working on this water network, he has nothing to do with how any of this paper is worded.  Please do not give him grief for what is in it.  He has a future paper on the subject and I guarantee these problems will not be repeated. 

   Anyway, I think YT is a fine product. I use it almost every day and encourage all students to use it. I am embarrassed this was lost from the paper and will work to restore it.  But at the same time, to deny that Paraview comes with many advantages - especially as the world becomes more huge 3D TB sized datasets - is a little naive. 



[2] http://lists.spacepope.org/pipermail/yt-users-spacepope.org/2014-June/005272.html

--
------------------------------------------------------------------------
Joseph Smidt <josephsmidt@gmail.com>

Theoretical Division
P.O. Box 1663, Mail Stop B283
Los Alamos, NM 87545
Office: 505-665-9752
Fax:    505-667-1931
_______________________________________________
yt-dev mailing list -- yt-dev@python.org
To unsubscribe send an email to yt-dev-leave@python.org