If you've ever stared at a data flow diagram and felt completely lost about what each shape and arrow actually means, you're not alone. Misreading even one notation can lead to misunderstandings about how a system works, which causes poor design decisions, wasted time, and avoidable errors down the line. Learning how to read DFD notations correctly is a foundational skill for anyone involved in systems analysis, software development, or business process mapping.

What Are Data Flow Diagram Notations and Why Do They Matter?

Data flow diagrams use a small set of standardized symbols to represent how data moves through a system. These symbols show where data comes from, where it goes, how it gets processed, and where it's stored. If you misinterpret a single symbol, you might think a system stores data it doesn't, or that a process happens in a sequence it actually doesn't.

There are two common notation standards: Yourdon DeMarco and Gane-Sarson. Both use four core components, but the shapes differ slightly. Knowing which notation you're looking at is the first step to reading the diagram correctly.

If you need a refresher on the individual pieces, our breakdown of DFD diagram components and their structure covers each part in detail.

What Do the Four Core Symbols Actually Represent?

Every data flow diagram uses four types of symbols. Here's what each one means:

External Entities (Source or Destination)

These are represented as rectangles (sometimes squared rectangles in Gane-Sarson). External entities are people, organizations, or other systems that send data into or receive data from the system you're diagramming. They sit outside the boundary of the system. When you see a rectangle, think: "this is where data enters or leaves."

Processes

In Yourdon DeMarco notation, processes are circles (also called "bubbles"). In Gane-Sarson notation, they're rounded rectangles. A process takes input data and transforms it into output data. Each process should have at least one data flow coming in and at least one going out. The label inside describes the action, such as "Validate Order" or "Calculate Tax."

Data Stores

Data stores are shown as open-ended rectangles (like a rectangle missing one side). They represent places where data is held databases, files, spreadsheets, or any form of data storage. Each data store has a name and usually a number. If a data flow arrow points into a store, data is being written. If an arrow points out, data is being read.

Data Flows

These are arrows or lines that show the movement of data between entities, processes, and stores. The arrowhead indicates the direction of data movement. A label on the flow describes what data is moving, like "Customer Name" or "Payment Details." Our guide on data flow diagram symbols and their meanings dives deeper into what each variation looks like.

How Do You Tell Which Notation Standard You're Looking At?

Check the process shapes first. Circles mean Yourdon DeMarco. Rounded rectangles mean Gane-Sarson. External entities are rectangles in both, but Gane-Sarson tends to use a squared-off style. Data stores look similar in both systems. If the diagram doesn't include a legend, the process shape is your quickest clue.

How Do You Read a DFD Step by Step?

Follow this approach whenever you encounter a data flow diagram you need to understand:

  1. Start with the external entities. These tell you who or what interacts with the system. Look at every rectangle on the edges of the diagram.
  2. Trace the data flows from each entity. Follow the arrows. What data is coming in? Where does it go first?
  3. Read the process labels. Each bubble or rounded rectangle describes an action. Understand what transformation happens at each step.
  4. Check the data stores. Look at what's being stored and when data is written to or read from each store.
  5. Follow the arrows out. See where processed data ends up. Does it go to another process, back to an entity, or into a store?

This sequence helps you trace the complete data path through the system without getting overwhelmed by the whole diagram at once.

What Are Common Mistakes When Reading DFD Notations?

  • Confusing data flows with control flows. DFDs only show data movement, not the sequence or timing of events. An arrow means "this data moves here," not "this happens next."
  • Ignoring data store direction. An arrow going into a data store is not the same as one coming out. A write is different from a read. Misreading this leads to wrong assumptions about how the system handles data.
  • Mixing up notation standards. If you assume circles mean processes in a Gane-Sarson diagram, you'll be confused. Always identify the notation first.
  • Skipping the labels. Every data flow, process, and store should have a label. If a label is missing, it's a sign the diagram is incomplete not something to guess about.
  • Reading level-0 diagrams the same as level-1 diagrams. A context diagram (level 0) shows the entire system as one process. Lower levels show detail. Don't expect the same level of information from every layer.

When Would You Need to Read a Data Flow Diagram?

You'll encounter DFDs in several real situations:

  • During requirements gathering analysts create DFDs to show stakeholders how the proposed system will handle data.
  • In legacy system documentation older systems often have DFDs instead of modern UML diagrams.
  • When auditing or troubleshooting DFDs help identify where data bottlenecks or errors might occur.
  • In academic or certification settings many computer science and business analysis courses require DFD literacy.
  • For regulatory compliance industries like healthcare and finance use DFDs to document data handling for audits.

Are There Any Quick Tips to Remember?

  • External entities never connect directly to each other. Data must pass through at least one process.
  • A process with no output is called a "black hole" and signals an error in the diagram.
  • A process with no input is called a "miracle" and is also an error.
  • Every data flow needs a meaningful label. "Data" or "Info" is too vague to be useful.
  • Data stores should not connect directly to external entities without going through a process.

For a deeper look at how these pieces fit together structurally, our article on DFD diagram components and structure walks through how to build and verify a well-formed diagram.

What Should You Do After You Can Read a DFD?

Once you're comfortable reading notations, try creating your own diagram from a simple scenario like an online order process or a library checkout system. Start with a context diagram, then break it down into a level-1 diagram. Label every flow, process, and store. Check that there are no black holes, miracles, or unlabeled arrows. This hands-on practice is the fastest way to lock in what you've learned.

Quick checklist for reading any DFD:

  • Identify the notation standard (Yourdon DeMarco or Gane-Sarson) by the process shape.
  • List all external entities these are your system boundaries.
  • Trace every data flow arrow and read its label.
  • Confirm each process has at least one input and one output.
  • Verify the direction of arrows in and out of data stores.
  • Check for missing labels, black holes, or miracle processes.
  • Understand which diagram level you're reading (context vs. detailed).

For further reading on standard DFD notation conventions, the Wikipedia article on data flow diagrams provides a solid technical reference.