Re: [matplotlib-devel] Better defaults all around?
On Nov 26, 2014 10:04 PM, Nathaniel Smith n...@pobox.com wrote: On Wed, Nov 26, 2014 at 9:30 AM, Todd toddr...@gmail.com wrote: On Sat, Nov 22, 2014 at 12:22 AM, Nathaniel Smith n...@pobox.com wrote: - Default line colors: The rgbcmyk color cycle for line plots doesn't appear to be based on any real theory about visualization -- it's just the corners of the RGB color cube, which is a highly perceptually non-uniform space. The resulting lines aren't terribly high contrast against the default white background, and the different colors have varying luminance that makes some lines pop out more than others. Seaborn's default is to use a nice isoluminant variant on matplotlib's default: http://web.stanford.edu/~mwaskom/software/seaborn/tutorial/aesthetics.html ggplot2 uses isoluminant colors with maximally-separated hues, which also works well. E.g.: http://www.cookbook-r.com/Graphs/Colors_%28ggplot2%29/ggplot2_scale_hue_colors_l45.png About this, I am not expert so forgive me if this is nonsensical. However, it would seem to me that these requirements are basically the same as the requirements for the new default colormap that prompted this whole discussion. So, rather than create two inconsistent set of colors that accomplish similar goals, might it be better to instead use the default colormap for the line colors? You could pick N equally-spaced colors from the colormap and use those as the line colors. The main differences in requirements are: - for the color cycle, you want isoluminant colors, to avoid the issue where one line is glaring bright red and one is barely-visible-grey. For general-purpose 2d colormaps, though, you almost always want the luminance to vary to help distinguish colors from each other. If you used isoluminance colors for the lines, wouldn't that mean a plot printed in grayscale would have all lines be the same shade of gray? -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___ Matplotlib-devel mailing list Matplotlib-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/matplotlib-devel
Re: [matplotlib-devel] Better defaults all around?
On Thu, Nov 27, 2014 at 9:54 AM, Todd toddr...@gmail.com wrote: On Nov 26, 2014 10:04 PM, Nathaniel Smith n...@pobox.com wrote: The main differences in requirements are: - for the color cycle, you want isoluminant colors, to avoid the issue where one line is glaring bright red and one is barely-visible-grey. For general-purpose 2d colormaps, though, you almost always want the luminance to vary to help distinguish colors from each other. If you used isoluminance colors for the lines, wouldn't that mean a plot printed in grayscale would have all lines be the same shade of gray? Yes. But IME it's very difficult to use greyscale alone to distinguish between multiple plot lines no matter what: you can't go much beyond 2 lines before you either end up with hard-to-see lines (b/c they don't have enough contrast with the white background) or the lines become nigh-indistinguishable (which one is the slightly-darker grey?). And if you have substantial luminance variation to make the greyscale work, then the color images end up looking really weird (the scarlet versus faint-yellow problem, where you end up emphasizing one set of data over another -- emphasis should be done on purpose! in matplotlib's current color cycle the yellow and cyan tend to disappear). If you're worried about greyscale then IMHO you should use different line styles (solid/dashed/dotted/...) and/or use solid black for everything and label the lines directly. Which isn't to say that there's never any value in picking line colors from a colormap, it's just more complicated than it seems :-). -n -- Nathaniel J. Smith Postdoctoral researcher - Informatics - University of Edinburgh http://vorpus.org -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ Matplotlib-devel mailing list Matplotlib-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/matplotlib-devel
Re: [matplotlib-devel] Better defaults all around?
On Wed, Nov 26, 2014 at 1:30 AM, Todd toddr...@gmail.com wrote: About this, I am not expert so forgive me if this is nonsensical. However, it would seem to me that these requirements are basically the same as the requirements for the new default colormap that prompted this whole discussion. So, rather than create two inconsistent set of colors that accomplish similar goals, might it be better to instead use the default colormap for the line colors? You could pick N equally-spaced colors from the colormap and use those as the line colors. I'm no expert either, but while similar principles about colorblind compatibility, etc apply, you want to sue a different scheme to represent a continuous range of colors and a set of distinct colors that aren't intended to be ranked. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/ORR(206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception chris.bar...@noaa.gov -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___ Matplotlib-devel mailing list Matplotlib-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/matplotlib-devel
Re: [matplotlib-devel] Better defaults all around?
On 26 Nov 2014, at 07:53 pm, Chris Barker chris.bar...@noaa.gov wrote: On Wed, Nov 26, 2014 at 1:30 AM, Todd toddr...@gmail.com wrote: About this, I am not expert so forgive me if this is nonsensical. However, it would seem to me that these requirements are basically the same as the requirements for the new default colormap that prompted this whole discussion. So, rather than create two inconsistent set of colors that accomplish similar goals, might it be better to instead use the default colormap for the line colors? You could pick N equally-spaced colors from the colormap and use those as the line colors. I'm no expert either, but while similar principles about colorblind compatibility, etc apply, you want to sue a different scheme to represent a continuous range of colors and a set of distinct colors that aren't intended to be ranked. I’ve also become throughly annoyed with the default colour cycle, especially with its glaring cyan-magenta contrast, and found it desirable to have an easier way to customise this either explicitly or by changing color_cycle. As there are already a couple of sequences existing in the available colourmaps that could be useful for different purposes or tastes, what’s lacking in particular in my view is an easier-to-use interface to draw colours from those maps; I think that’s along the lines of what Todd also has suggested further down in his mail. I’ve written a little utility I’m simply appending because it’s so short, which returns an array of colours of specified length that could be passed to axes.color_cycle or just explicitly used as crange[i]. Also useful to colour scatter plot markers according to a certain quantity (pass this quantity as “values” to crange). Regarding to the above, I think sometimes the line colour requirements are similar to those for a general colourmap, e.g. I often want to plot a series of lines like different spectra, which are easily enough distinguishable, but should IMO reflect a certain continuous trend like different temperatures - are ranked, IOW - and thus would be well represented by a sequence of values from “heat or “coolwarm. However there are still some additional requirements, as you’d generally want every colour to have enough contrast on a white or bright background canvas. In the example below I’ve added a “max_lum” keyword to darken whitish or yellow colours appropriately. This is probably not extremely sophisticated in terms of colour physiology, but if you have a suggestion if and where it could be added to matplotlib, I could go ahead and make a pull request (and try to find the time to add some tests and examples). Cheers, Derek def crange(cmap, values, max_lum=1, start=0, stop=255, vmin=None, vmax=None): Returns RGBA colour array of length values from colormap cmap cmap: valid matplotlib.cm colormap name or instance values: either int - number of colour values to return or array of values to be mapped on colormap range max_lum: restrict colours to maximum brightness (1=white) start,stop: range of colormap to use (full range 0-255) vmin,vmax: input values mapped to start/stop (default actual data limits) try: if np.isscalar(values): vrange = np.linspace(start,stop,np.int(values)) else: v = np.array(values).astype(np.float) vmin = vmin or v.min() vmax = vmax or v.max() vrange = start+(v-vmin)*(stop-start)/(vmax-vmin) except (ValueError, TypeError) as err: print(invalid input values: must be no. of colours or array: %s % err) return None vrange = np.uint8(np.round(vrange)) cmap = matplotlib.cm.get_cmap(cmap) lcor = (1.0-max_lum) / 9 crange = cmap(vrange) crange[:,:3] *= (1-crange[:,:3].sum(axis=1)**2*lcor).reshape(-1,1) return crange -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ Matplotlib-devel mailing list Matplotlib-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/matplotlib-devel
Re: [matplotlib-devel] Better defaults all around?
On Wed, Nov 26, 2014 at 9:30 AM, Todd toddr...@gmail.com wrote: On Sat, Nov 22, 2014 at 12:22 AM, Nathaniel Smith n...@pobox.com wrote: - Default line colors: The rgbcmyk color cycle for line plots doesn't appear to be based on any real theory about visualization -- it's just the corners of the RGB color cube, which is a highly perceptually non-uniform space. The resulting lines aren't terribly high contrast against the default white background, and the different colors have varying luminance that makes some lines pop out more than others. Seaborn's default is to use a nice isoluminant variant on matplotlib's default: http://web.stanford.edu/~mwaskom/software/seaborn/tutorial/aesthetics.html ggplot2 uses isoluminant colors with maximally-separated hues, which also works well. E.g.: http://www.cookbook-r.com/Graphs/Colors_%28ggplot2%29/ggplot2_scale_hue_colors_l45.png About this, I am not expert so forgive me if this is nonsensical. However, it would seem to me that these requirements are basically the same as the requirements for the new default colormap that prompted this whole discussion. So, rather than create two inconsistent set of colors that accomplish similar goals, might it be better to instead use the default colormap for the line colors? You could pick N equally-spaced colors from the colormap and use those as the line colors. The main differences in requirements are: - for the color cycle, you want isoluminant colors, to avoid the issue where one line is glaring bright red and one is barely-visible-grey. For general-purpose 2d colormaps, though, you almost always want the luminance to vary to help distinguish colors from each other. - for the color cycle, there's no problem with using widely separated hues -- in fact it's usually better b/c it increases contrast between the different items, and there's no need to communicate an ordering between them. But if you try to use the whole hue space in a colormap then you end up with the much-loathed jet. -n -- Nathaniel J. Smith Postdoctoral researcher - Informatics - University of Edinburgh http://vorpus.org -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ Matplotlib-devel mailing list Matplotlib-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/matplotlib-devel
Re: [matplotlib-devel] Better defaults all around?
On 22 Nov 2014 02:22, Benjamin Root ben.r...@ou.edu wrote: Some of your wishes are in progress already: https://github.com/matplotlib/matplotlib/pull/3818 There is also an issue open about scaling the dashes with the line width, and you are right, the spacing for the dashes are terrible. Nice! I can definitely see the argument to making a bunch of these visual changes together. Preferably, I would like to do these changes via style sheets so that we can provide a classic stylesheet for backwards compatibility. Yeah, I didn't want to get into the details of mechanism here because that's a comparatively simple technical question, compared to the questions about whether we should make changes and which changes we should make. But I'm definitely assuming we'll provide a simple supported/documented way to request the old defaults, and I agree that the obvious way is by swapping out stylesheets. This might require adding a few more parameters to rcParam, but I'm guessing that won't be a big deal. I do actually like the autoscaling system as it exists now. The problem is that the data margins feature is applied haphazardly. The power spectra example is a good example of where we could smarten the system. Can you elaborate on what you like about it? Like I said, when I first heard about it sounded like a neat idea. But in practice, over my years of using matplotlib... sometimes it's been fine, and sometimes it's made me roll my eyes/swear, but I don't think there's been a single instance where I looked at a graph and thought oo, nice one matplotlib - your insistence on shrinking my data to use fewer pixels in order to get a major tick lined up exactly with the spines has really improved this graph. Neat tick/spine alignment really is the highest priority in data visualization. Even in the rare cases where my measurement scale actually does have a neat 0-1 or 0-100 range, I usually find that matplotlib has chosen something like 0-90, or, if we fix the issue with cramming data right up into the axes, then I guess I'll end up with -10 - 110. Which looks worse than something like -4 - 104, because with -4 - 104, my outermost axis labels are 0 and 100. With -10 - 110, the outermost labels are -10 and 110, and it's weird and confusing to have axis labels naming impossible values. So can you share your examples of where this behavior has given you substantively better results? As for the ticks... I think that is a very obscure edge-case. I personally prefer inward. Yeah, that one is a pet peeve - I was gratified to see that the seaborn folks also took the trouble to fix it (I'm not alone!). To be fair, though, the reason I noticed isn't that I care a lot about ticks per se, it's because the default was screwing up my figures so I had to go track it down :-/. Here's another example -- the final versions of the autocorrelation graphs I mentioned above. [image: Inline image 1] In both of these graphs, having the ticks to point inwards created weird confusing intersections with the lines, so I had to flip them to point outwards. It's just an objective thing, if you use the same pixels for data and metadata then that creates room for ugly stuff to happen. And when it comes to defaults, if you have two choices that are basically equivalent, except that one is always fine and one is usually fine but sometimes screws things up, then the former seems like the obvious choice... -n -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___ Matplotlib-devel mailing list Matplotlib-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/matplotlib-devel
Re: [matplotlib-devel] Better defaults all around?
On 2014/11/22, 9:06 AM, gary ruben wrote: A few thoughts to add to the excellent ones to date, to do with colorbar behaviour. My general comment would be that if the axis tick formatter defaults are changed not to forget about the colorbar as I typically find it needs more tweaking than the main axes. I'll make a couple of suggestions, but these are low on the list compared to the suggestions that others have made. 1. consider rasterizing colorbar contents by default 2. make colorbar axis sizing for matshow behave like imshow 1. consider rasterizing colorbar contents by default Eric describes this here http://matplotlib.1069221.n5.nabble.com/rasterized-colorbar-td39582.html and suggests that rasterizing the colorbar may not be desirable, although I'm not totally sure why. Perhaps it is because I have noticed that mixing rasterized content with vector lines/axes in matplotlib is generally imperfect. If saving the figure as a pdf or svg with dpi left at default, you can usually see offsets and scaling problems. For example after rasterizing a colorbar I usually see white pixels along the top and side within the vector colorbar frame. This also shows up when using imshow or matshow to show images. I don't know if this is an agg limitation, a backend limitation or a bug. If it's a known limitation, maybe avoid this suggestion, but if it's a bug, maybe it can be fixed and then rasterizing the colorbar might become a better default option. I think the problem is that the outlines are snapped to pixel boundaries, but the color blocks are not. Something like that. I think a similar problem is manifest in the small offsets often seen between colorbar ticks and colorbar boundaries. For colorbars I usually do lots of tweaking along the lines of: cb = plt.colorbar(format=ScalarFormatter(useMathText=True)) cb.formatter.set_useOffset(False) cb.formatter.set_scientific(True) cb.formatter.set_powerlimits((0,2)) cb.update_ticks() cb.solids.set_rasterized(True) although I'm not sure about advocating useMathText and set_scientific for defaults. I wonder what other think about this? I don't see why you would want the *default* to be to override the rcParams setting for use_mathtext. This just makes it harder to document, and harder for people to keep track of what determines what. To some extent this applies to the rest of your customizations as well. Deviations from the rcParams defaults via special cases, hardwired into mpl, should be avoided as much as possible. A richer configuration system, building on rcParams or some modification of it, will probably be the goal instead. The evolving style module is a step in this direction. Things like default powerlimits for the colorbar might be rethought. I think colorbars typically have too many ticks and associated labels and they should perhaps favour integer labels over floating point representation if possible. In the extreme case, for continuous colormaps, often a tick at just the top and bottom of the range would be adequate. I agree, but the question is how to make it as easy as possible for each user to get their desired result. I don't think this is the time to do much in the way of tweaking hard-wired defaults. 2. I'm not sure how much pyplot.matshow is generally used but I still use it. Could the colorbar height for matshow pick up the axis height of the main figure, or maybe imshow could default to interpolation='nearest' so I wouldn't be tempted to use matshow any more? For example, plt.matshow(rand(20,20)) plt.colorbar() doesn't behave nicely like plt.imshow(rand(20,20), interpolation='nearest') plt.colorbar() The difference is that matshow is adjusting the figure size based on the array dimensions without taking into account the later addition of a colorbar. The only way to fix this in our present framework would be to use a kwarg to tell matshow to include a colorbar from the start, so it would be able to calculate the figure size appropriately. With imshow plus a colorbar, the nice behavior occurs only for a particular small range of array dimension ratios, such as the unity ratio in your example. For example, try using rand(5, 10). Eric Gary -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ Matplotlib-devel mailing list Matplotlib-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/matplotlib-devel
Re: [matplotlib-devel] Better defaults all around?
I'd like to propose an update to the default boxplot symbology: all black Q: How much more black could the boxplots be? A: None. None more black. (sorry, ben) On Fri, Nov 21, 2014 at 7:18 PM, Benjamin Root ben.r...@ou.edu wrote: With regards to defaults for 2.0, I am actually all for breaking them for the better. What I find important is giving users an easy mechanism to use an older style, if it is important to them. The current behavior isn't buggy (for the most part) and failing to give users a way to get behavior that they found desirable would be alienating. I think this is why projects like prettyplotlib and seaborn have been so important to matplotlib. It enables those who are in the right position to judge styles to explore the possibilities easily without commiting matplotlib to any early decision and allowing it to have a level of stability that many users find attractive. At the moment, the plans for the OO interface changes should not result in any (major) API breaks, so I am not concerned about that at the moment. Let's keep focused on style related issues in this thread. Tabbed figures? Intriguing... And I really do need to review that MEP of yours... Cheers! Ben Root On Fri, Nov 21, 2014 at 9:36 PM, Federico Ariza ariza.feder...@gmail.com wrote: I like the idea of aligning a set of changes for 2.0 even if still far away. Regarding to backwards compatibility I think that indeed it is important but when changing mayor version (1.x to 2.0) becomes less important and we must take care of prioritizing evolution. Take for example the OO interface (not defined yet) this is very probable to break the current pyplot interface but still this is a change that needs to be done. In terms of defaults. I would like to see the new Navigation as default (if it gets merged) and tabbed figures (to come after navigation), having separate figures feel kind of ...old On 21 Nov 2014 21:23, Benjamin Root ben.r...@ou.edu wrote: Some of your wishes are in progress already: https://github.com/matplotlib/matplotlib/pull/3818 There is also an issue open about scaling the dashes with the line width, and you are right, the spacing for the dashes are terrible. I can definitely see the argument to making a bunch of these visual changes together. Preferably, I would like to do these changes via style sheets so that we can provide a classic stylesheet for backwards compatibility. I do actually like the autoscaling system as it exists now. The problem is that the data margins feature is applied haphazardly. The power spectra example is a good example of where we could smarten the system. As for the ticks... I think that is a very obscure edge-case. I personally prefer inward. It is good to get these grievances enumerated. I am interested in seeing where this discussion goes. Cheers! Ben Root On Fri, Nov 21, 2014 at 6:22 PM, Nathaniel Smith n...@pobox.com wrote: Hi all, Since we're considering the possibility of making a matplotlib 2.0 release with a better default colormap, it occurred to me that it might make sense to take this opportunity to improve other visual defaults. Defaults are important. Obviously for publication graphs you'll want to end up tweaking every detail, but (a) not everyone does but we still have to read their graphs, and (b) probably only 1% of the plots I make are for publication; the rest are quick one-offs that I make on-the-fly to help me understand my own data. For such plots it's usually not worth spending much/any time tweaking layout details, I just want something usable, quickly. And I think there's a fair amount of low-hanging improvements possible. Batching multiple visual changes like this together seems much better than spreading them out over multiple releases. It keeps the messaging super easy to understand: matplotlib 2.0 is just like 1.x, your code will still work, the only difference is that your plots will look better by default. And grouping these changes together makes it easier to provide for users who need to revert back to the old defaults -- it's easy to provide simple binary choice between before 2.0 versus after 2.0, harder to keep track of a bunch of different changes spread over multiple releases. Some particular annoyances I often run into and that might be candidates for changing: - The default method of choosing axis limits is IME really, really annoying, because of the way it tries to find round number boundaries. It's a clever idea, but in practice I've almost never seen this pick axis limits that are particularly meaningful for my data, and frequently it picks particularly bad ones. For example, suppose you want to plot the spectrum of a signal; because of FFT's preference for power-of-two sizes works it's natural to end up with samples ranging from 0 to 255. If you plot this, matplotlib will give you an xlim of (0, 300), which looks pretty ridiculous. But even worse
Re: [matplotlib-devel] Better defaults all around?
I would be also quite interested in having better defaults. My list of complains are: * Easy way to get only two lines for axis (left and down, instead of four) * Better default font (Source Sans Pro / Source Code Pro for example (open source)) * Better default colormap * Better axis limit (when you draw with thick lines, they get cut) * Better icons for the toolbar (there are a lot of free icons around) * Better colors (more pastel) * Less cluttered figures * Lighter grids + All Nathaniel's suggestions Ideally, we could have a set of standard figures for each main type (plot, scatter, quiver) and tweak parameters to search for the best output. Nicolas On 22 Nov 2014, at 04:18, Benjamin Root ben.r...@ou.edu wrote: With regards to defaults for 2.0, I am actually all for breaking them for the better. What I find important is giving users an easy mechanism to use an older style, if it is important to them. The current behavior isn't buggy (for the most part) and failing to give users a way to get behavior that they found desirable would be alienating. I think this is why projects like prettyplotlib and seaborn have been so important to matplotlib. It enables those who are in the right position to judge styles to explore the possibilities easily without commiting matplotlib to any early decision and allowing it to have a level of stability that many users find attractive. At the moment, the plans for the OO interface changes should not result in any (major) API breaks, so I am not concerned about that at the moment. Let's keep focused on style related issues in this thread. Tabbed figures? Intriguing... And I really do need to review that MEP of yours... Cheers! Ben Root On Fri, Nov 21, 2014 at 9:36 PM, Federico Ariza ariza.feder...@gmail.com wrote: I like the idea of aligning a set of changes for 2.0 even if still far away. Regarding to backwards compatibility I think that indeed it is important but when changing mayor version (1.x to 2.0) becomes less important and we must take care of prioritizing evolution. Take for example the OO interface (not defined yet) this is very probable to break the current pyplot interface but still this is a change that needs to be done. In terms of defaults. I would like to see the new Navigation as default (if it gets merged) and tabbed figures (to come after navigation), having separate figures feel kind of ...old On 21 Nov 2014 21:23, Benjamin Root ben.r...@ou.edu wrote: Some of your wishes are in progress already: https://github.com/matplotlib/matplotlib/pull/3818 There is also an issue open about scaling the dashes with the line width, and you are right, the spacing for the dashes are terrible. I can definitely see the argument to making a bunch of these visual changes together. Preferably, I would like to do these changes via style sheets so that we can provide a classic stylesheet for backwards compatibility. I do actually like the autoscaling system as it exists now. The problem is that the data margins feature is applied haphazardly. The power spectra example is a good example of where we could smarten the system. As for the ticks... I think that is a very obscure edge-case. I personally prefer inward. It is good to get these grievances enumerated. I am interested in seeing where this discussion goes. Cheers! Ben Root On Fri, Nov 21, 2014 at 6:22 PM, Nathaniel Smith n...@pobox.com wrote: Hi all, Since we're considering the possibility of making a matplotlib 2.0 release with a better default colormap, it occurred to me that it might make sense to take this opportunity to improve other visual defaults. Defaults are important. Obviously for publication graphs you'll want to end up tweaking every detail, but (a) not everyone does but we still have to read their graphs, and (b) probably only 1% of the plots I make are for publication; the rest are quick one-offs that I make on-the-fly to help me understand my own data. For such plots it's usually not worth spending much/any time tweaking layout details, I just want something usable, quickly. And I think there's a fair amount of low-hanging improvements possible. Batching multiple visual changes like this together seems much better than spreading them out over multiple releases. It keeps the messaging super easy to understand: matplotlib 2.0 is just like 1.x, your code will still work, the only difference is that your plots will look better by default. And grouping these changes together makes it easier to provide for users who need to revert back to the old defaults -- it's easy to provide simple binary choice between before 2.0 versus after 2.0, harder to keep track of a bunch of different changes spread over multiple releases. Some particular annoyances I often run into and that might be candidates for changing: - The default method of choosing axis limits is IME really,
Re: [matplotlib-devel] Better defaults all around?
Some of your wishes are in progress already: https://github.com/matplotlib/matplotlib/pull/3818 There is also an issue open about scaling the dashes with the line width, and you are right, the spacing for the dashes are terrible. I can definitely see the argument to making a bunch of these visual changes together. Preferably, I would like to do these changes via style sheets so that we can provide a classic stylesheet for backwards compatibility. I do actually like the autoscaling system as it exists now. The problem is that the data margins feature is applied haphazardly. The power spectra example is a good example of where we could smarten the system. As for the ticks... I think that is a very obscure edge-case. I personally prefer inward. It is good to get these grievances enumerated. I am interested in seeing where this discussion goes. Cheers! Ben Root On Fri, Nov 21, 2014 at 6:22 PM, Nathaniel Smith n...@pobox.com wrote: Hi all, Since we're considering the possibility of making a matplotlib 2.0 release with a better default colormap, it occurred to me that it might make sense to take this opportunity to improve other visual defaults. Defaults are important. Obviously for publication graphs you'll want to end up tweaking every detail, but (a) not everyone does but we still have to read their graphs, and (b) probably only 1% of the plots I make are for publication; the rest are quick one-offs that I make on-the-fly to help me understand my own data. For such plots it's usually not worth spending much/any time tweaking layout details, I just want something usable, quickly. And I think there's a fair amount of low-hanging improvements possible. Batching multiple visual changes like this together seems much better than spreading them out over multiple releases. It keeps the messaging super easy to understand: matplotlib 2.0 is just like 1.x, your code will still work, the only difference is that your plots will look better by default. And grouping these changes together makes it easier to provide for users who need to revert back to the old defaults -- it's easy to provide simple binary choice between before 2.0 versus after 2.0, harder to keep track of a bunch of different changes spread over multiple releases. Some particular annoyances I often run into and that might be candidates for changing: - The default method of choosing axis limits is IME really, really annoying, because of the way it tries to find round number boundaries. It's a clever idea, but in practice I've almost never seen this pick axis limits that are particularly meaningful for my data, and frequently it picks particularly bad ones. For example, suppose you want to plot the spectrum of a signal; because of FFT's preference for power-of-two sizes works it's natural to end up with samples ranging from 0 to 255. If you plot this, matplotlib will give you an xlim of (0, 300), which looks pretty ridiculous. But even worse is the way this method of choosing xlims can actually obscure data -- if the extreme values in your data set happen to fall exactly on a round number, then this will be used as the axis limits, and you'll end up with data plotted directly underneath the axis spine. I frequently encounter this when making scatter plots of data in the 0-1 range -- the points located at exactly 0 and 1 are very important to see, but are nearly invisible by default. A similar case I ran into recently was when plotting autocorrelation functions for different signals. For reference I wanted to include the theoretically ideal ACF for white noise, which looks like this: plt.plot(np.arange(1000), [1] + [0] * 999) Good luck reading that plot! R's default rule for deciding axis limits is very simple: extend the data range by 4% on each side; those are your limits. IME this rule -- while obviously not perfect -- always produces something readable and unobjectionable. - Axis tickmarks should point outwards rather than inwards: There's really no advantage to making them point inwards, and pointing inwards means they can obscure data. My favorite example of this is plotting a histogram with 100 bins -- that's an obvious thing to do, right? Check it out: plt.hist(np.random.RandomState(0).uniform(size=10), bins=100) This makes me do a double-take every few months until I remember what's going on: WTF why is the bar on the left showing a *stacked* barplot...oh right those are just the ticks, which happen to be exactly the same width as the bar. Very confusing. Seaborn's built-in themes give you the options of (1) no axis ticks at all, just a background grid (by default the white-on-light-grey grid as popularized by ggplot2), (2) outwards pointing tickmarks. Either option seems like a better default to me! - Default line colors: The rgbcmyk color cycle for line plots doesn't appear to be based on any real theory about visualization -- it's just the corners of the
Re: [matplotlib-devel] Better defaults all around?
I like the idea of aligning a set of changes for 2.0 even if still far away. Regarding to backwards compatibility I think that indeed it is important but when changing mayor version (1.x to 2.0) becomes less important and we must take care of prioritizing evolution. Take for example the OO interface (not defined yet) this is very probable to break the current pyplot interface but still this is a change that needs to be done. In terms of defaults. I would like to see the new Navigation as default (if it gets merged) and tabbed figures (to come after navigation), having separate figures feel kind of ...old On 21 Nov 2014 21:23, Benjamin Root ben.r...@ou.edu wrote: Some of your wishes are in progress already: https://github.com/matplotlib/matplotlib/pull/3818 There is also an issue open about scaling the dashes with the line width, and you are right, the spacing for the dashes are terrible. I can definitely see the argument to making a bunch of these visual changes together. Preferably, I would like to do these changes via style sheets so that we can provide a classic stylesheet for backwards compatibility. I do actually like the autoscaling system as it exists now. The problem is that the data margins feature is applied haphazardly. The power spectra example is a good example of where we could smarten the system. As for the ticks... I think that is a very obscure edge-case. I personally prefer inward. It is good to get these grievances enumerated. I am interested in seeing where this discussion goes. Cheers! Ben Root On Fri, Nov 21, 2014 at 6:22 PM, Nathaniel Smith n...@pobox.com wrote: Hi all, Since we're considering the possibility of making a matplotlib 2.0 release with a better default colormap, it occurred to me that it might make sense to take this opportunity to improve other visual defaults. Defaults are important. Obviously for publication graphs you'll want to end up tweaking every detail, but (a) not everyone does but we still have to read their graphs, and (b) probably only 1% of the plots I make are for publication; the rest are quick one-offs that I make on-the-fly to help me understand my own data. For such plots it's usually not worth spending much/any time tweaking layout details, I just want something usable, quickly. And I think there's a fair amount of low-hanging improvements possible. Batching multiple visual changes like this together seems much better than spreading them out over multiple releases. It keeps the messaging super easy to understand: matplotlib 2.0 is just like 1.x, your code will still work, the only difference is that your plots will look better by default. And grouping these changes together makes it easier to provide for users who need to revert back to the old defaults -- it's easy to provide simple binary choice between before 2.0 versus after 2.0, harder to keep track of a bunch of different changes spread over multiple releases. Some particular annoyances I often run into and that might be candidates for changing: - The default method of choosing axis limits is IME really, really annoying, because of the way it tries to find round number boundaries. It's a clever idea, but in practice I've almost never seen this pick axis limits that are particularly meaningful for my data, and frequently it picks particularly bad ones. For example, suppose you want to plot the spectrum of a signal; because of FFT's preference for power-of-two sizes works it's natural to end up with samples ranging from 0 to 255. If you plot this, matplotlib will give you an xlim of (0, 300), which looks pretty ridiculous. But even worse is the way this method of choosing xlims can actually obscure data -- if the extreme values in your data set happen to fall exactly on a round number, then this will be used as the axis limits, and you'll end up with data plotted directly underneath the axis spine. I frequently encounter this when making scatter plots of data in the 0-1 range -- the points located at exactly 0 and 1 are very important to see, but are nearly invisible by default. A similar case I ran into recently was when plotting autocorrelation functions for different signals. For reference I wanted to include the theoretically ideal ACF for white noise, which looks like this: plt.plot(np.arange(1000), [1] + [0] * 999) Good luck reading that plot! R's default rule for deciding axis limits is very simple: extend the data range by 4% on each side; those are your limits. IME this rule -- while obviously not perfect -- always produces something readable and unobjectionable. - Axis tickmarks should point outwards rather than inwards: There's really no advantage to making them point inwards, and pointing inwards means they can obscure data. My favorite example of this is plotting a histogram with 100 bins -- that's an obvious thing to do, right? Check it out:
Re: [matplotlib-devel] Better defaults all around?
With regards to defaults for 2.0, I am actually all for breaking them for the better. What I find important is giving users an easy mechanism to use an older style, if it is important to them. The current behavior isn't buggy (for the most part) and failing to give users a way to get behavior that they found desirable would be alienating. I think this is why projects like prettyplotlib and seaborn have been so important to matplotlib. It enables those who are in the right position to judge styles to explore the possibilities easily without commiting matplotlib to any early decision and allowing it to have a level of stability that many users find attractive. At the moment, the plans for the OO interface changes should not result in any (major) API breaks, so I am not concerned about that at the moment. Let's keep focused on style related issues in this thread. Tabbed figures? Intriguing... And I really do need to review that MEP of yours... Cheers! Ben Root On Fri, Nov 21, 2014 at 9:36 PM, Federico Ariza ariza.feder...@gmail.com wrote: I like the idea of aligning a set of changes for 2.0 even if still far away. Regarding to backwards compatibility I think that indeed it is important but when changing mayor version (1.x to 2.0) becomes less important and we must take care of prioritizing evolution. Take for example the OO interface (not defined yet) this is very probable to break the current pyplot interface but still this is a change that needs to be done. In terms of defaults. I would like to see the new Navigation as default (if it gets merged) and tabbed figures (to come after navigation), having separate figures feel kind of ...old On 21 Nov 2014 21:23, Benjamin Root ben.r...@ou.edu wrote: Some of your wishes are in progress already: https://github.com/matplotlib/matplotlib/pull/3818 There is also an issue open about scaling the dashes with the line width, and you are right, the spacing for the dashes are terrible. I can definitely see the argument to making a bunch of these visual changes together. Preferably, I would like to do these changes via style sheets so that we can provide a classic stylesheet for backwards compatibility. I do actually like the autoscaling system as it exists now. The problem is that the data margins feature is applied haphazardly. The power spectra example is a good example of where we could smarten the system. As for the ticks... I think that is a very obscure edge-case. I personally prefer inward. It is good to get these grievances enumerated. I am interested in seeing where this discussion goes. Cheers! Ben Root On Fri, Nov 21, 2014 at 6:22 PM, Nathaniel Smith n...@pobox.com wrote: Hi all, Since we're considering the possibility of making a matplotlib 2.0 release with a better default colormap, it occurred to me that it might make sense to take this opportunity to improve other visual defaults. Defaults are important. Obviously for publication graphs you'll want to end up tweaking every detail, but (a) not everyone does but we still have to read their graphs, and (b) probably only 1% of the plots I make are for publication; the rest are quick one-offs that I make on-the-fly to help me understand my own data. For such plots it's usually not worth spending much/any time tweaking layout details, I just want something usable, quickly. And I think there's a fair amount of low-hanging improvements possible. Batching multiple visual changes like this together seems much better than spreading them out over multiple releases. It keeps the messaging super easy to understand: matplotlib 2.0 is just like 1.x, your code will still work, the only difference is that your plots will look better by default. And grouping these changes together makes it easier to provide for users who need to revert back to the old defaults -- it's easy to provide simple binary choice between before 2.0 versus after 2.0, harder to keep track of a bunch of different changes spread over multiple releases. Some particular annoyances I often run into and that might be candidates for changing: - The default method of choosing axis limits is IME really, really annoying, because of the way it tries to find round number boundaries. It's a clever idea, but in practice I've almost never seen this pick axis limits that are particularly meaningful for my data, and frequently it picks particularly bad ones. For example, suppose you want to plot the spectrum of a signal; because of FFT's preference for power-of-two sizes works it's natural to end up with samples ranging from 0 to 255. If you plot this, matplotlib will give you an xlim of (0, 300), which looks pretty ridiculous. But even worse is the way this method of choosing xlims can actually obscure data -- if the extreme values in your data set happen to fall exactly on a round number, then this will be used as the axis limits, and you'll end up with data plotted directly