5 Steps To Improve Your Flow Cytometry Data Analysis

With the continued emphasis on improving the reproducibility of scientific data, it is critical to remember that there is no single step that will solve this problem. Instead, it is a mindset that needs to be adopted. From adding in additional descriptors in the data to ensuring the proper information is extracted these steps, especially when communicated, can improve the robustness of your data. Further, as more and more automated analytical tools are being developed, using

Improving the quality of your data starts with how you approach your data analysis process, your experimental design and what happens when you sit down at the instrument.

1. Add keywords at the beginning of your experimental setup.

The flow cytometry standard file (FCS) is composed of two pieces – the listmode data, a spreadsheet of the values for each cell and a header filer – which is where the keywords are stored. These keywords are defined by the FCS standard and are automatically added by the instrument to the file. When troubleshooting an experiment, this is a good place to look at how the system was set up for the acquisition.

In addition to these defined terms, you are able to add keywords to your flow cytometry data when setting up your experiment. By doing this, it is possible to add information about the sample, the stains, the treatment conditions, etc. that ultimately makes it easier to search and organize your files during analysis.

It is important when adding keywords, especially when performing a longitudinal study, that the terms are consistently entered. Depending on the downstream applications and how you might parse the keywords – CD3 is different from Cd3 and T0 is not the same as T=0. So make sure you are consistent in this process.

If you are using FCS Express, you can access this under the Data Tab. This brings up a window that also allows you to select all keywords. Additionally, you can add keywords here if needed. Another nice feature is that if you have information stored in the header file that needs to be removed for privacy reasons, there is an option to anonymize the header information. A great way to protect files with PHI or other sensitive information.

Figure 1: FCS Header as displayed in FCS Express 6.

Developing your naming convention during the experimental design phase is a good practice. This way it becomes ingrained into the workflow and template design.

2. Develop a quality control program.

Quality control is the name of the game to help improve the quality and consistency of any data generated. Instrument quality control is typically performed using some OEM recommendation that is implemented by those in charge of the instruments. It’s a good idea to ask them about this QC and if you can review it from time to time. The last thing that you want to have happen is that your big find turns out to be an instrument issue rather than biological. QC helps prevent this by ensuring the instrument is running the same way on a daily/weekly/monthly basis.

However, that is not where it should end. Each researcher should invest a bit of time to add experimental quality control to their experiments. This QC helps ensure that when you go to sit down to the machine at 20:00, after a long day of preparing your sample, that everything is still working. This can easily be achieved by using a bead and establishing target values for this bead based on the experiment. These values would be determined during the optimization phase of experimental development. Figure 2 shows what such a template might look like.

Figure 2: Experimental QC template.

This template indicates the name, lot and expiration date for the QC beads. On each of the plots is a target value to reach when setting up the experiment.

Advantages of this method are that it provides the researcher with a separate QC that can be tracked and used to show consistency in their data acquisition process. It will also reveal if there are issues with the instrument before the actual samples are put on the system.

A second recommendation for a QC program is to introduce a reference control into the experimental workflow. This is a sample that you know how it should behave when stained, and helps to ensure that the staining process worked. An added advantage of the reference control is that it is an ideal sample to train others on the protocol. If they can’t get the expected results, then back to pipet school for them!

3. Доверяй, но проверяй.

As the Russian proverb tells us, Доверяй, но проверяй, (trust, but verify). Many software packages have automatic settings for compensation. These settings and the impact that they have on compensation should not be used without verification. There are three that are particularly important to check.

  1. The number of events to collect – Compensation is based on a statistical measure of the central tendency of the negative and positive populations. The better this measure, the better the determination of compensation. This begs the question of how many events are enough? As was discussed in a previous post on this site, when using a bead-based carrier, make sure to collect at least 10,000 events. For cells as a carrier, at least 30,000. If you are using cells and the target is a rarer event, collecting more events is necessary. This is one reason that beads are a useful control.
  2. Avoid the universal negative – Many software packages drive the compensation analysis down a path where a single tube is used to set the negative populations for compensation. This is ultimately a violation of the second rule of compensation, which requires the positive and negative carrier to have the same autofluorescence. The best practice is to have both a negative and positive control in each compensation tube so that when compensating, there is no ambiguity or concerns over was the second rule violated.   Figure 3: Issues with using a ‘Universal’ negative in compensation.
  3. Check the automatic gating – In some software, compensation is automated to identifies the target particles and gates the positive and negative sample. This can work very well if the samples are clean and well separated. However, in the case of using cells, or with samples with debris, this gating can fail. An example of this is shown below in.   Figure 4: When automatic compensation goes wrong The debris in the compensation control tube seems to cause the algorithm some confusion, (right), and the software generates an oddly shaped gate that includes this debris and only part of the correct sample. Contrast that to the sample on the right, which has a similar pattern, but the software clearly captures the correct population.

While using automatic compensation is the correct way to compensate the experiment, make sure that these glitches are corrected before compensation is calculated.

4. Use all the proper controls.

One of the most important factors in improving reproducibility is to use the correct controls in the experiment. These samples change one thing compared to the experimental sample so that it is possible to assess changes that are due to the experimental parameters and not the data acquisition process.

A lot has been written on proper controls for a flow cytometry experiment such as this article. In addition to quality control, as discussed above, proper experimental controls include compensation controls, FMO controls, unstained (autofluorescence) controls, un-stimulated controls, and positive controls.

It goes without saying that each control should be used for what it is designed to control for and not overinterpreted and that these controls work in concert to set the correct gates to identify the populations of interest.

Figure 5: Using FMO and Unstimulated controls in concert to properly set gates. Data from Maecker and Trotter (2006). Dashed lines are added for illustration purposes.

In figure 5, the authors show three different controls that could be used to set gates on the stimulate sample at the top. The Isotype control, a historical control that has marginal value, has been discussed in detail in this article. The blue line represents this control, and as is shown on the plot on the top, would significantly reduce the number of positive cells.

The FMO control (red line) is used to address issues of spectral spillover into the channel of interest. Looking at the lower left plot, there appear to be some background binding of the target antibody on the unstimulated cells, so the final gate needs to take this into account.

When developing a panel, during the optimization phase, put all the controls you can think of to test. The goal is to identify which controls are critical for identifying the populations of interest, and those that do not help in that process can be excluded.

5. Extract the correct data.

In hypothesis-driven science, the goal is to extract the correct information from a set of experiments and use that in an appropriate statistical test to confirm or refute the hypothesis. Even before performing experiments, this information needs to be considered and determined so that an analysis plan can be established. Doing this before performing experiments will prevent HARKing – hypothesizing after results are known, and p Hacking – where multiple statistical tests are performed to identify one that demonstrates significance in the data.

Performing the correct statistical test is essential to reproducible data, as was discussed here. Make sure that you’re taking the time at the start of the experiment to develop this plan. Knowing what the critical data are to extract from the experiments will help guide the experimental design, as well as helping to identify the critical controls to be run. Thus, it’s critical to ensure that this plan is developed well in advance of putting cells on the cytometer.

To get the best flow cytometry data you need to be thinking about all the steps in your experiment to ensure that you have high-quality data to analyze. To improve the quality of your analysis and to properly track your experiments, make sure you’re adding keywords at the beginning of your experimental setup. Don’t rely on just the daily QC that is performed on the instrument – develop a quality control program that is appropriate for your experiments to add confidence in your results. When using automated compensation programs leave the wizards to fantasy and verify that the algorithms have performed correctly. Identify and don’t misuse or overinterpret the controls when analyzing the data. Finally, know what the end goal of the experiments is and make sure to extract the appropriate data.

These steps will ultimately help improve the reproducibility of your experiments and confidence in the conclusions.

To learn more about the 5 Steps To Improve Your Flow Cytometry Data Analysis, and to get access to all of our advanced materials including 20 training videos, presentations, workbooks, and private group membership, get on the Flow Cytometry Mastery Class wait list.

Join Expert Cytometry's Mastery Class

ABOUT TIM BUSHNELL, PHD

Tim Bushnell holds a PhD in Biology from the Rensselaer Polytechnic Institute. He is a co-founder of—and didactic mind behind—ExCyte, the world’s leading flow cytometry training company, which organization boasts a veritable library of in-the-lab resources on sequencing, microscopy, and related topics in the life sciences.

Tim Bushnell, PhD

Similar Articles

The Power Of Spectral Viewers And Their Use In Full Spectrum Flow Cytometry

The Power Of Spectral Viewers And Their Use In Full Spectrum Flow Cytometry

By: Tim Bushnell, PhD

What photon from yonder fluorochrome breaks?  It is … umm… hmmm. Let me see. Excitation off a 561 nm laser, with an emission maximum of 692 nm. I’m sure if Shakespeare was a flow cytometrist, he might have written that very scene. But the play is lost in time. However, since the protagonist had difficulty determining what fluorochrome was emitting photons, let’s consider how this could be figured out. In my opinion, one of the handiest flow cytometry tools is the spectral viewer. This tool helps visualize the excitation and emission profile of different fluorochromes, as well as allowing you…

Fickle Markers: Solutions For Antibody Binding Specificity Challenges

Fickle Markers: Solutions For Antibody Binding Specificity Challenges

By: Tim Bushnell, PhD

Reproducibility has been an ongoing, and important, concept in the sciences for years.  In the area of biomedical research, the alarm was sounded by several papers published in the early 2010’s.  Authors like Begley and Ellis, Prinz and coworkers, and Vasilevsky and colleagues, among others reported an alarming trend in the reproducibility of pre-clinical data.  These reports indicated between 50% to almost 90% of published pre-clinical data were not reproducible.  This was further highlighted in the article by Freedman and coworkers, who tried to identify and quantify the different sources of error that could be causing this crisis.  Figure 1,…

5 Common Flow Cytometry Questions, Answered

5 Common Flow Cytometry Questions, Answered

By: Tim Bushnell, PhD

I want to thank all of you who send us your questions about flow cytometry, so I thought I would dip into the old email bag and answer a few of the common ones here.  If your question isn’t answered this time, look for it to be answered in a future blog post.  Of course, if you want us to cover a specific topic, drop us a line.  1. How Fast Can I Go? This is  a common question. The allure of the ‘hi’ button is hard to resist.  The faster you go, the sooner you are finished with data…

Combining Flow Cytometry With Plant Science, Microorganisms, And The Environment

Combining Flow Cytometry With Plant Science, Microorganisms, And The Environment

By: Tim Bushnell, PhD

My first introduction to flow cytometry was talking to a professor who’d brought one on a research cruise to study phytoplankton. It was only later that I was introduced to the marvelous world that’s been my career for over 20 years.   In that time, I’ve had the opportunity to work with researchers in many different areas, exposing me to a wide variety of cell types and more important assays. What continues to amaze me is the number of different parameters we can measure, not just the number of fluorochromes, but the information we can extract from samples – animal, vegetable…

Common Numbers-Based Questions I Get As A Flow Cytometry Core Manager And How To Answer Them

Common Numbers-Based Questions I Get As A Flow Cytometry Core Manager And How To Answer Them

By: Tim Bushnell, PhD

Numbers are all around us.  My personal favorite is ≅1.618 aka ɸ aka ‘the golden ratio’.  It’s found throughout history, where it has influenced architects and artists. We see it in nature, in plants, and it is used in movies to frame shots. It can be approximated by the Fibonacci sequence (another math favorite of mine). However, I have not worked out how to apply this to flow cytometry.  That doesn’t mean numbers aren’t important in flow cytometry. They are central to everything we do, and in this blog, I’m going to flit around numbers-based questions that I have received…

3 Must-Have High-Dimensional Flow Cytometry Controls

3 Must-Have High-Dimensional Flow Cytometry Controls

By: Tim Bushnell, PhD

Developments such as the recent upgrade to the Cytobank analysis platform and the creation of new packages such as Immunocluster are reducing the computational expertise needed to work with high-dimensional flow cytometry datasets. Whether you are a researcher in academia, industry, or government, you may want to take advantage of the reduced barrier to entry to apply high-dimensional flow cytometry in your work. However, you’ll need the right experimental design to access the new transformative insights available through these approaches and avoid wasting the considerable time and money required for performing them. As with all experiments, a good design begins…

The Fluorochrome Less Excited: How To Build A Flow Cytometry Antibody Panel

The Fluorochrome Less Excited: How To Build A Flow Cytometry Antibody Panel

By: Tim Bushnell, PhD

Fluorochrome, antibodies and detectors are important. The journey of a thousand cells starts with a good fluorescent panel. The polychromatic panel is the combination of antibodies and fluorochromes. These will be used during the experiment to answer the biological question of interest. When you only need a few targets, the creation of the panel is relatively straightforward. It’s only when you start to get into more complex panels with multiple fluorochromes that overlap in excitation and emission gets more interesting.  FLUOROCHROMES Both full spectrum and traditional fluorescent flow cytometry rely on measuring the emission of the fluorochromes that are attached…

Flow Cytometry Year in Review: Key Changes To Know

Flow Cytometry Year in Review: Key Changes To Know

By: Meerambika Mishra

Here we are, at the end of an eventful year 2021. But with the promise of a new year 2022 to come. It has been a long year, filled with ups and downs. It is always good to reflect on the past year as we move to the future.  In Memoriam Sir Isaac Newton wrote “If I have seen further, it is by standing upon the shoulders of giants.” In the past year, we have lost some giants of our field including Zbigniew Darzynkiwicz, who contributed much in the areas of cell cycle analysis and apoptosis. Howard Shapiro, known for…

What Star Trek Taught Me About Flow Cytometry

What Star Trek Taught Me About Flow Cytometry

By: Tim Bushnell, PhD

It is no secret that I am a very big fan of the Star Trek franchise. There are many good episodes and lessons explored in the 813+ episodes, 12 movies (and counting). Don’t worry, this blog is not going to review all 813, or even 5 of them. Instead, some of the lessons I have taken away from the show that have applicability to science and flow cytometry.  “Darmok and Jalad at Tanagra.”  (ST:TNG season 5, episode 2) This is probably one of my favorite episodes, which involves Picard and an alien trying to establish a common ground and learn…

Top Industry Career eBooks

Get the Advanced Microscopy eBook

Get the Advanced Microscopy eBook

Heather Brown-Harding, PhD

Learn the best practices and advanced techniques across the diverse fields of microscopy, including instrumentation, experimental setup, image analysis, figure preparation, and more.

Get The Free Modern Flow Cytometry eBook

Get The Free Modern Flow Cytometry eBook

Tim Bushnell, PhD

Learn the best practices of flow cytometry experimentation, data analysis, figure preparation, antibody panel design, instrumentation and more.

Get The Free 4-10 Compensation eBook

Get The Free 4-10 Compensation eBook

Tim Bushnell, PhD

Advanced 4-10 Color Compensation, Learn strategies for designing advanced antibody compensation panels and how to use your compensation matrix to analyze your experimental data.