McClean: Plotting Stacked Histograms: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
Line 14: Line 14:


Chose bins (you probably want to use the same bin for every plot, since you will be stacking them along the same y-axis) and then bin your data using the Matlab "hist" command.  We also keep track of the distributions' means since we use this to color the histograms later.
Chose bins (you probably want to use the same bin for every plot, since you will be stacking them along the same y-axis) and then bin your data using the Matlab "hist" command.  We also keep track of the distributions' means since we use this to color the histograms later.
<pre>
<pre>
%Set up bins (we are making histograms of flow cytometry data so we chose logarithmically spaced bins):
%Set up bins (we are making histograms of flow cytometry data so we chose logarithmically spaced bins):
Line 31: Line 32:
</pre>
</pre>


[[Image:Fig1_BarChart.png]]
We set up a colormap so that our histograms change in color as the mean of their distribution increases:
 


This looks ok, but we would really like some error bars, so we use a handy function (barwitherr) from the Matlab file exchange:
<pre>
<pre>
h=figure; hold;
%% Define a colormap for the histograms that will make the histograms brighter as the mean of the distribution increases
barwitherr([Strain1_std' Strain2_std'], [1 2 3 4],[Strain1_Mean' Strain2_Mean'])
 
legend('Strain 1','Strain 2')
% In this case we chose to make the histograms brighter green at higher
pause; close all;
% mean values since the flow cytometry data is of GFP.
%Define a color map
MMColorMap=zeros(5,3);
 
%Define colors so that they scale with the difference between the mean
%fluorescence at a given timepoint and the mean at time 0
 
 
MM=sort(Means);
MMdiff=Means-Means(1);
MMdiff=MMdiff./(max(MMdiff));
 
 
MMColorMap(1:end,2)=MMdiff;
 
%Set up the figure and axis properties:
h=figure; hold; colors=colormap;
 
set(gca,'XScale','log')
set(gca,'XLim',[10,2000])
set(gca,'PlotBoxAspectRatioMode','manual')
set(gca,'PlotBoxAspectRatio',[1 3 1])
set(gca,'FontSize',12)
set(gca,'XTick',[100 1000 10000 100000])
set(gca,'YTick',[0 1])
ylabel('Fraction of Cell Population','FontSize',14)
xlabel('Fluorescence [a.u.]','FontSize',14)
</pre>
</pre>


[[Image:Fig2_BarChart.png]]


This is ok, but we'd rather only have one-sided error bars.  To do this, use a 4x2x2 matrix for the errors: cat(3,zeros(4,2),[Strain1_std' Strain2_std'])


The function barwitherr uses the first matrix zeros(4,2) as the lower error, and the second matrix [Strain1_std' Strain2_std'] for the upper errors:
Plot the histograms along the y-axis.  We choose the spacing variable empirically so that the plot "looks good":
<pre>
 
barwitherr(cat(3,zeros(4,2),[Strain1_std' Strain2_std']), [1 2 3 4],[Strain1_Mean' Strain2_Mean'])
legend('Strain 1','Strain 2')
pause; close all;
</pre>
[[Image:Fig3_BarChartEx.png‎]]


Don't like the colors? You can change them by modifying the colormap:
<pre>
<pre>
barmap=[0.7 0.7 0.7; 0.05 .45 0.1]; %[0.7 0.7 0.7] is grey, [ 0.05 .45 0.1] is green
spacing=.15; %Spacing along the y-axis chosen empirically
colormap(barmap);
ylabel('Data','FontSize',14)
title('Title of Experiment','FontSize',14)
pause;
</pre>


[[Image:Fig4_BarChartEx.png‎]]
for i=1:5
    fill([x(1);x'; x'],[i*spacing; (HistData(i,:)+i*spacing)'; ones(1,length(x))'*i*spacing],MMColorMap(i,:),'LineStyle','none')
    semilogx(x,HistData(i,:)+i*spacing,'LineWidth',3,'Color','k');
end


It isn't very useful to have our experimental conditions labelled 1,2,3,4.  To change the x-ticks to labels:
<pre>
set(ax, 'XTick',[1 2 3 4],'XTickLabel',{'A','B','C','D' });
pause;
</pre>
</pre>


[[Image:Fig5_BarChartEx.png‎]]
[[Image:Fig4_BarChartEx.png‎]]


Maybe we would like more information in our x-tick labels.  But if the labels are too long, they will overlap.  To get around this, rotate the x-tick labels using the function xticklabel_rotate from the Matlab file exchange (see references below):
Save your figure in a variety of formats for later use (recall that we made h our figure handle):
<pre>
<pre>
set(ax, 'FontSize',12,'XTick',[1 2 3 4],'XTickLabel',{'Condition A','Condition B','Condition C','Condition D' });
saveas(h,'ExampleStackedHistograms','fig')
xticklabel_rotate([1 2 3 4],45,{'Condition A','Condition B','Condition C','Condition D' })
saveas(h,'ExampleStackedHistograms','png')
pause
saveas(h,'ExampleStackedHistograms','ai')
saveas(h,'ExampleStackedHistograms','pdf')
</pre>
</pre>
[[Image:Fig6_BarChartEx.png‎]]


 
[[Image:Fig4_BarChartEx.png‎]]
If you are going to use this figure in a presentation or paper you can save it in various forms (including as a file for adobe illustrator).  Recall that h is our figure handle:
<pre>
saveas(h, 'ExampleBar.fig','fig')
saveas(h, 'ExampleBar.png','png')
saveas(h, 'ExampleBar.ai','ai')
close all;
</pre>


=Code=
=Code=

Revision as of 15:14, 17 July 2013

Summary

This explains the basics of plotting histograms stacked vertically (this allows you to see the shift, in for instance, fluorescence in a population of cells analyzed by flow cytometry).

Example

Your data could be anything. In this example, the variable "Data" contains five (5) rows, each of which contain 9000 fluorescence readings from a FACS experiment. Each row represents a timepoint, with induction of GFP increasing with time.


%% Preliminaries:

close all; clear all;
load('Data.mat')

Chose bins (you probably want to use the same bin for every plot, since you will be stacking them along the same y-axis) and then bin your data using the Matlab "hist" command. We also keep track of the distributions' means since we use this to color the histograms later.

%Set up bins (we are making histograms of flow cytometry data so we chose logarithmically spaced bins):
bins=logspace(0,4,60);
x=bins;

%Bin the data using "hist" and keep track of the number of elements "n" in each bin "x" for each row in "Data".  Also keep track of the mean of each row of "Data":

HistData=[];  
Means=[];

for i=1:5
    [n,x]=hist(Data(i,:),x);
    HistData=[HistData; n./sum(n)];
    Means=[Means mean(Data(i,:))];
end

We set up a colormap so that our histograms change in color as the mean of their distribution increases:


%% Define a colormap for the histograms that will make the histograms brighter as the mean of the distribution increases

% In this case we chose to make the histograms brighter green at higher
% mean values since the flow cytometry data is of GFP.
 
%Define a color map
MMColorMap=zeros(5,3);

%Define colors so that they scale with the difference between the mean
%fluorescence at a given timepoint and the mean at time 0


MM=sort(Means);
MMdiff=Means-Means(1);
MMdiff=MMdiff./(max(MMdiff));


MMColorMap(1:end,2)=MMdiff;

%Set up the figure and axis properties:
h=figure; hold; colors=colormap;

set(gca,'XScale','log')
set(gca,'XLim',[10,2000])
set(gca,'PlotBoxAspectRatioMode','manual')
set(gca,'PlotBoxAspectRatio',[1 3 1])
set(gca,'FontSize',12)
set(gca,'XTick',[100 1000 10000 100000])
set(gca,'YTick',[0 1])
ylabel('Fraction of Cell Population','FontSize',14)
xlabel('Fluorescence [a.u.]','FontSize',14)


Plot the histograms along the y-axis. We choose the spacing variable empirically so that the plot "looks good":


spacing=.15;  %Spacing along the y-axis chosen empirically 

for i=1:5
    fill([x(1);x'; x'],[i*spacing; (HistData(i,:)+i*spacing)'; ones(1,length(x))'*i*spacing],MMColorMap(i,:),'LineStyle','none')
    semilogx(x,HistData(i,:)+i*spacing,'LineWidth',3,'Color','k');
end

Save your figure in a variety of formats for later use (recall that we made h our figure handle):

saveas(h,'ExampleStackedHistograms','fig')
saveas(h,'ExampleStackedHistograms','png')
saveas(h,'ExampleStackedHistograms','ai')
saveas(h,'ExampleStackedHistograms','pdf')
 

Code

You can copy and paste the code below into a Matlab m-file to run all of the examples shown above. You will also need the two functions listed in the references below, available from the Matlab file exchange at Matlab Central.

close all;

%Suppose you have the following data for two different strains across 4
%different experimental conditions (Conditions A,B,C,D, from left to right)
Strain1_Mean=[0.5137    3.2830    1.5887    5.9188];
Strain2_Mean=[0.4042    2.9884    0.5709    2.7766];
Strain1_std=[1.1393    2.8108    2.2203    3.5233];
Strain2_std=[0.8762    2.8478    0.9878    2.2197];


%Plot this data as a bar chart
bar([1 2 3 4],[Strain1_Mean' Strain2_Mean'])
legend('Strain 1','Strain 2')
pause; close all;

%This looks ok, but we would really like some error bars, so we use a handy
%function from the file exchange:
h=figure; hold;
barwitherr([Strain1_std' Strain2_std'], [1 2 3 4],[Strain1_Mean' Strain2_Mean'])
legend('Strain 1','Strain 2')
pause; close all;

%This is ok, but we'd rather only have one-sided error bars.  To do this,
%you will send barwitherr zeros for the lower error and keep the upper
%error as is by sending in the matrix cat(3,zeros(4,2),[Strain1_std'
%Strain2_std']) for the error
barwitherr(cat(3,zeros(4,2),[Strain1_std' Strain2_std']), [1 2 3 4],[Strain1_Mean' Strain2_Mean'])
legend('Strain 1','Strain 2')
pause; close all;

%Now let's use better colors by changing the color map and set the bar
%widths, line widths, axis fonts etc to something prettier
barwitherr(cat(3,zeros(4,2),[Strain1_std' Strain2_std']), [1 2 3 4],[Strain1_Mean' Strain2_Mean'],'LineWidth',2,'BarWidth',0.9)
legend('Strain 1','Strain 2')
%set the axis properties
ax=gca;
set(ax, 'FontSize',12)


%Don't like the colors? You can change them by modifying the colormap:
barmap=[0.7 0.7 0.7; 0.05 .45 0.1]; %[0.7 0.7 0.7] is grey, [ 0.05 .45 0.1] is a green
colormap(barmap);
ylabel('Data','FontSize',14)
title('Title of Experiment','FontSize',14)
pause; 

%It isn't very useful to have our experimental conditions labelled 1,2,3,4
%so can we change these to words? Yes:
set(ax, 'XTick',[1 2 3 4],'XTickLabel',{'A','B','C','D' });
pause;
%But this isn't perfect, maybe we want more information on the axis.  To
%have actual labels rotate them using the handy xticklabel_rotate function:
%set(ax, 'FontSize',12,'XTick',[1 2 3 4],'XTickLabel',{'Condition A','Condition B','Condition C','Condition D' });
xticklabel_rotate([1 2 3 4],45,{'Condition A','Condition B','Condition C','Condition D' })
pause

%If you are going to use this figure in a presentation or paper you can
%save it in various forms (including as a file for adobe illustrator):

%Recall that h is our figure handle:
 saveas(h, 'ExampleBar.fig','fig')
 saveas(h, 'ExampleBar.png','png')
 saveas(h, 'ExampleBar.ai','ai')
 
 close all;

Notes

Please feel free to post comments, questions, or improvements to this protocol. Happy to have your input!

  • Megan N McClean 17:27, 11 June 2012 (EDT): There are probably more elegant ways of doing this, but this solution has worked well for me so far. Please feel free to update and add information as you figure out better ways of doing this.

References

Function xticklabel_rotate: xticklabel_rotate

Function barwitherr: barwitherr

Contact

or instead, discuss this protocol.