TL;DR
- DEXA and Bod Pod are the two most defensible field-accessible methods for athletes. DEXA has ±1–2% error on body-fat percentage; Bod Pod ±2–3%.[1][2]
- Skinfold calipers, properly done, are within ±3–4% and remain the best cost-effective longitudinal tool.[5]
- Home BIA scales are the noisiest option, with daily variation of 3–5% just from hydration.[6]
- Trend, not absolute value. No method in the list gives you a number you should treat as ground-truth in isolation.
Body composition is harder to measure than weight. There's no metabolic-cart analogue for body fat — every available method is either indirect (DEXA, Bod Pod) or estimated from a correlate (skinfolds, BIA, tape measures). This article compares the main options by accuracy, reproducibility, and operational cost so you can choose the one that actually answers your question.
DEXA (Dual-Energy X-Ray Absorptiometry)
A medical-grade low-dose X-ray scan that distinguishes bone, fat, and lean tissue by attenuation at two energies. Output includes total body fat %, regional fat distribution, and bone mineral density.
Accuracy: ±1–2% body-fat error against hydrostatic weighing in validation studies[2][3]. Probably the best field-accessible body-composition tool currently available. Radiation dose is trivially low (≈0.001 mSv, well below a chest X-ray).
Limits:
- Hydration sensitivity. A post-workout scan can read 1–2% lower body fat than a rested morning scan on the same person.
- Machine-to-machine variation. GE Lunar and Hologic scanners produce systematically different numbers. Use the same scanner for tracking.
- Cost. €50–€150 per scan in most European markets. Not practical to repeat monthly.
Best for: baseline and periodic (every 3–6 months) measurement, with the same scanner, same time of day, similar hydration.
Bod Pod (air-displacement plethysmography)
You sit inside a sealed chamber; the machine measures body volume by air displacement, then uses body density to infer fat mass via Siri's equation. Pneumatic, not ionising — no radiation.
Accuracy: ±2–3% body-fat error[2]. Slightly noisier than DEXA but comparable for practical purposes.
Limits:
- Cheaper than DEXA per scan (€30–€80) but availability is spottier; not every clinic has one.
- Sensitive to lung volume estimation — the test asks you to breathe quietly and can be biased by irregular breathing.
- Does not give you regional fat distribution or bone mineral density.
Best for: athletes who want a defensible body-fat estimate without radiation and can access a facility locally.
Skinfold calipers
Pinch subcutaneous fat at 3, 4, or 7 standardised sites; apply the Jackson-Pollock[5] or Durnin-Womersley regression to estimate body density, then Siri for body-fat percentage.
Accuracy when performed by a trained tester: ±3–4% body fat[1]. When performed by an untrained tester: the number is mostly noise.
Practical strengths:
- Excellent longitudinal tool. The same tester on the same athlete every 2–4 weeks produces a trend line you can trust, even if the absolute % is off.
- Cheap. Harpenden calipers last decades; a €80 Slim Guide is adequate.
- Resistant to hydration. Measures subcutaneous fat directly; glycogen and water shifts don't distort the measurement the way they do DEXA and BIA.
Weaknesses:
- Self-administered skinfolds are unreliable for torso sites (suprailiac, subscapular). If you can't get a training partner or coach to pinch for you, the 3-site Jackson-Pollock sites you can reach (triceps, thigh, suprailiac) are more reliable than going solo on 7 sites.
- Systematic underestimation at very low body fat (<8% men, <14% women). The Siri equation assumes a typical hydration of lean mass that contest-lean lifters violate.
Navy / tape-measure method
The Body Fat Percentage Calculator implements the U.S. Navy method[4], which estimates body fat from neck, waist, and hip circumferences plus height. Accuracy against hydrostatic weighing: ±3–4% body fat in the validation cohort, noisier at the population extremes.
Practical strengths:
- No equipment beyond a measuring tape.
- Reproducible if you measure consistently (waist at navel, same time of day, fasted).
- Resistant to the hydration noise that plagues BIA.
Weaknesses:
- Systematically overestimates body fat in lean muscular lifters (a narrow waist relative to height can still produce inflated estimates if shoulders and chest are large).
- The formula assumes a typical fat distribution. Visceral-heavy fat distribution reads higher; evenly distributed fat reads more accurately.
BIA (bioelectrical impedance)
Passes a small AC current through the body; fat-free mass conducts better than fat mass. Output: body-fat %, sometimes broken down by segment.
Accuracy: ±5–8% body fat on consumer scales[6]. Reasonable hand-to-foot medical-grade units can hit ±3–4%.
Key weakness: hydration is the primary input. A morning, dehydrated reading can be 3–5% higher than a post-dinner reading on the same day. For longitudinal tracking, BIA requires strict protocol (same time, same hydration status, same day of menstrual cycle for women) or the day-to-day noise swamps the weekly signal.
The best use of BIA is directional ("am I trending down?") rather than absolute ("I am 14.3% body fat").
Hydrostatic weighing and 4-compartment models
Hydrostatic (underwater) weighing is the classical gold standard for body-composition measurement, treating the body as two compartments (fat and fat-free mass) and using Archimedes' principle to determine body density. Accuracy: ±1.5% body fat in validated cohorts. Practically unavailable for most athletes outside research labs.
Four-compartment models (fat, water, bone mineral, protein) are even more accurate (±1%) but require DEXA, hydrostatic or Bod Pod, and bioimpedance data all combined. Reserved for academic research and elite-athlete contexts with direct access to these facilities.
Longitudinal tracking protocol
Regardless of method, the longitudinal protocol that produces defensible trend data looks the same:
- Same method each time. Don't switch between DEXA and skinfolds mid-block; they measure differently.
- Same conditions. Time of day, hydration status, fasting vs fed, day of cycle (women). Variables you don't control become noise.
- Same operator where applicable. Skinfolds with different testers are effectively different methods. DEXA on different machines is effectively different methods.
- Enough frequency to see the trend, not so much that you see noise. See the frequency table below.
Method Optimal frequency Why that frequency
────────────────────────────────────────────────────────────────
DEXA Every 3–6 months Cost + radiation band
Bod Pod Every 2–3 months Availability
Skinfolds (trained) Every 2–4 weeks Tracks body-composition changes
Navy tape Every 2 weeks Low-noise, repeatable
BIA (protocol-adherent) Weekly Noisier but cheap
Photos Monthly Biggest signal, lowest effort Body fat percentage vs body fat trajectory
Two lifters can both be “14% body fat” while one is heading up and one heading down. The trajectory matters more than the absolute reading. Protocol:
- Pick a method.
- Measure at the start of a block.
- Measure again every 4 weeks across the block.
- Plot the three or four readings.
- The slope is signal; the individual readings are noisy points along it.
A 4% apparent change over 12 weeks is almost certainly real (exceeds the error band of most methods). A 1–2% apparent change over 4 weeks could easily be noise.
Method comparison
Method Accuracy (BF%) Cost per read Longitudinal usefulness
───────────────────────────────────────────────────────────────────────────────────────
DEXA ±1–2% €50–150 Excellent (quarterly)
Bod Pod ±2–3% €30–80 Very good
Skinfold (trained) ±3–4% one-time €80 Excellent (bi-weekly)
Skinfold (self) ±5–7% one-time €80 OK
Navy tape ±3–4% free Good
BIA (medical-grade) ±3–4% €30–60 Good with protocol
BIA (consumer scale) ±5–8% €50 one-time Directional only
Visual estimation from photo ±3–5% free Surprisingly good Visual estimation isn't silly
Against DEXA, trained observers visually estimating body fat from four standardised photos perform roughly on par with consumer-grade BIA scales. A good body-fat reference chart is a defensible sanity check on any other method. If DEXA says 8% and you don't have striations, someone's wrong — probably the DEXA if you scanned dehydrated post-lift.
Recommendation by use case
Recreational lifter tracking long-term changes: Navy tape monthly + photos. Cheap, reproducible, directional. Upgrade to DEXA annually if you want a calibration point.
Competitive bodybuilder / physique athlete: Trained-tester skinfolds biweekly during prep + DEXA at the start and end of prep. Calipers handle the weekly trend; DEXA bounds the absolute number.
Powerlifter / strength athlete: Don't over-index on body-fat percentage. Track bodyweight and performance; let FFMI and waist-to-height give you the composition proxy. FFMI is more informative for strength-sport goals than absolute body fat.
General fitness tracking: Whatever method you'll actually do consistently. The best body-composition method is the one you use on schedule for two years.
Summary
- DEXA is the accuracy floor for field-accessible methods; skinfolds are the cost-effectiveness floor.
- BIA is fine for direction, unreliable for absolute values.
- Longitudinal trend matters more than any single reading.
- For strength athletes, FFMI and bodyweight trends often tell you more than body-fat % does.
Tools: Body Fat Percentage Calculator (Navy), Lean Body Mass Calculator, FFMI Calculator.
Population boundaries of the validation data
Every body-composition method's accuracy figure is derived from a specific validation sample. Honest interpretation requires knowing who the sample was:
- DEXA validation studies. Most multi-method comparisons[2] recruited healthy adults ages 20–50 with BMI 18–35. Accuracy at the extremes — very lean athletes (BF<8%), extreme muscular lifters, and obese adults (BMI>35) — is meaningfully worse. For contest-lean bodybuilders, Kasper et al. 2021[1] documented DEXA reading body-fat percentages that fell outside the validated ±2% band — actual error up to 3–4% for hydration-extreme states.
- Navy tape method validation. The 1984 U.S. Naval Health Research Center study[4] fit the equation on 274 Navy personnel (men and women, active-duty, BMI 18–32). The cohort was young adult, moderately trained, with typical military body composition. For competitive powerlifters with outsized necks and narrow waists, the formula over-reads consistently; for contest-lean bodybuilders, it under-reads. Population-specific calibration shifts of 2–4% are routine.
- Jackson-Pollock skinfold regressions. Fit on roughly 300 men (3-site and 7-site variants)[5] ages 18–61. Black adults were under-represented; Asian adults were not in the fitting dataset. Published follow-up work shows the equation can under- or over-read by 2–3% in populations outside the fitting sample.
- BIA consumer-scale validation. Kyle et al. 2002[6] validated leg-to-leg BIA on 223 healthy adults. The equation assumes typical hydration; athletic and contest-prep populations routinely fall outside the assumed hydration range, producing systematic error that consumer BIA units don't correct for.
- No validation covers children, adolescents, pregnancy, or clinical populations. For anyone in these groups, the accuracy bands quoted in this article don't apply. Use clinical-grade DEXA with paediatric or population-appropriate references, not the consumer methods above.
Alternative-view framing: direct vs indirect measurement
Every method in this article is indirect. The gold-standard alternatives are limited but worth naming:
- Cadaver dissection. The original ground truth for body composition. Impractical.
- MRI whole-body segmentation. Directly segments adipose tissue from muscle and organ tissue. Accuracy within ±1% body fat. Clinical research only; cost and availability prevent routine use.
- Neutron activation analysis. Measures total body nitrogen (proxy for protein mass) and other elements via nuclear bombardment. Gold standard for body-component estimation. Research-only.
- 4-compartment models (4C). Combines DEXA, hydrostatic weighing or Bod Pod, and bioimpedance into a four-part decomposition (fat, water, bone mineral, protein). Most accurate field-accessible reference (±1%). Available in elite-athlete training centres and some university sports-science labs.
For an athlete asking "what should I use?" — the answer remains DEXA or skinfolds (trained tester) as the defensible practical ceilings, with the understanding that a single reading from either carries a ±2–4% confidence band that no amount of tooling will eliminate without moving to 4C or MRI.
Worked example: 12-week physique-prep tracking
A natural bodybuilder, 79 kg, 12% body fat (pre-prep DEXA baseline), starts a 12-week prep. Track with DEXA at weeks 0 and 12, trained-tester Jackson-Pollock 7-site skinfolds every 2 weeks, Navy tape weekly, and BIA daily (protocol-controlled: morning fasted). Photos weekly.
Week DEXA JP7 skinfolds Navy tape BIA (day 1) Photo rank
─────────────────────────────────────────────────────────────────────
0 12.0% 12.5% 13.2% 13.8% baseline
2 — 11.8% 12.4% 13.1% slight drop
4 — 10.9% 11.8% 12.7% visible
6 — 10.0% 11.0% 12.1% striations
8 — 9.1% 10.3% 11.6% strong
10 — 8.2% 9.6% 11.0% lean
12 7.5% 7.4% 8.8% 10.2% contest
DEXA delta (week 0 → 12): −4.5% body fat (4.5%/12 weeks)
JP7 delta (weeks 0 → 12): −5.1% body fat
Navy delta (weeks 0 → 12): −4.4% body fat
BIA delta: −3.6% body fat (systematically under)
All four methods agree on direction and rough magnitude.
Absolute readings diverge by 1–2 points at any single time-point,
converging at the prep endpoint where all methods read lean. The lesson: no single method is the "truth." The agreement across methods on the trajectory (−4.5 to −5 points over 12 weeks) is the robust signal. The daily BIA readings give the athlete a noisy but frequent adherence check; the biweekly skinfolds give the mesocycle-level trend; the bookend DEXA bounds the absolute. Photos remain the single most useful visual confirmation that the numbers are tracking actual tissue change, not model artefacts.
Common failure modes
- Switching methods mid-block. Scanning with DEXA in week 0 and skinfolds in week 6 doesn't produce comparable data — you can't diagnose whether a "change" is real or a methodology artefact. Pick one primary method and stick with it.
- DEXA scans with inconsistent hydration. A post-workout scan reads 1–2% lower body fat than a rested morning scan on the same person, same day[3]. If you're using DEXA for tracking, control hydration ruthlessly — fasted, well-rested, no training the day before.
- Self-administered 7-site skinfolds. The torso sites (suprailiac, subscapular, chest) can't be reliably pinched alone. Self-testing on these sites routinely produces ±5–7% body-fat noise[1]. If you can't get a training partner, use the 3-site accessible variant, not the 7-site.
- BIA readings used to make weekly programming decisions. Daily BIA noise of 3–5% from hydration alone[6] means a week-over-week 1% "change" is almost certainly noise. Use BIA for monthly trend, not weekly adjustment.
- Trusting a single DEXA at contest-lean. At BF<8%, the fixed hydration assumption in Siri-based models is violated — the scan can read 1–2% too low or high depending on glycogen state. Take two DEXAs 3–5 days apart in very lean conditions and average.
- Navy tape on an outlier frame. A lifter with 48 cm shoulders, 45 cm arms, and a 76 cm waist at 180 cm height reads high on Navy formula because the waist-to-neck ratio doesn't capture the lean-mass distribution. A second-method cross-check (skinfolds or DEXA) recalibrates the bias.
- Forgetting that "lean mass" includes water and glycogen. A 2 kg water/glycogen shift from carb-loading before a DEXA inflates measured LBM by 2 kg and drops measured body-fat percentage by roughly 1 point without any real tissue change. Control carbohydrate intake the day before any DEXA scan intended for comparison.
References
- 1 Accuracy of body fat assessment methods in competitive bodybuilders (DEXA, BIA, skinfolds, Bod Pod) — Journal of the International Society of Sports Nutrition (2021)
- 2 Comparison of dual-energy X-ray absorptiometry, air-displacement plethysmography, and hydrostatic weighing — Medicine and Science in Sports and Exercise (2002)
- 3 Reliability of DEXA measurements of lean mass in a clinical setting — Clinical Physiology and Functional Imaging (2015)
- 4 A circumference-based estimate of body fat (Navy method) — U.S. Naval Health Research Center (1984)
- 5 Generalized body composition prediction equation for men using simple measurement techniques (Jackson-Pollock 7-site) — Medicine & Science in Sports & Exercise (1978)
- 6 Validity of leg-to-leg bioelectrical impedance analysis in assessing body composition — European Journal of Clinical Nutrition (2002)