Key Takeaways

  • Accurate CPT coding separates evaluation work from administration and prevents underbilling

  • Audit-proof documentation starts with medical necessity, time tracking, and integrated reporting

  • Ethical, sustainable testing services balance fair compensation with client access and transparency

Stop losing money on testing because billing feels confusing

You finish a full test battery, write a careful report, and send the claim feeling done. Then the EOB comes back with a denial or a reduced payment because one code was missing, time was not supported, or the payer disagreed with how services were split.

When you track it correctly, a single comprehensive evaluation often includes 6–12+ hours of billable professional time across testing, scoring, interpretation, and report work. If even 1–2 hours of that time is not coded or documented in a way the payer accepts, the financial hit adds up quickly over a month of cases.

By the end of this section, you will be able to:

  • Choose the right CPT codes for the work you actually did

  • Document time in a way that matches how payers review testing claims

  • Spot common coding gaps that trigger denials or underpayment

Choose the right testing CPT codes without second-guessing

Next, pick codes that match what you actually did in the session, not what you planned to do. A common reason testing claims get denied or underpaid is when the billed units mix up evaluation work (clinical judgment) with administration (giving tests) and scoring (turning responses into numbers or results).

If you do one thing, do this: write down the work in the same order you did it, then map each chunk to the right code category. For example, a 2-hour visit might include 30 minutes of clinical interview and record review, 60 minutes of test administration by a technician, and 30 minutes of scoring and test selection decisions by the psychologist. Those are not interchangeable just because they happened on the same day.

Also, decide whether you are doing psychological or neuropsychological testing based on the primary referral question and the main focus of your interpretation. Psychological testing fits when the question is mainly about mood, anxiety, personality, behavior, or diagnostic clarification. Neuropsychological testing fits when the question is mainly about brain based functioning such as memory, attention, processing speed, language, or executive skills.

Here’s the catch: the measure mix alone does not choose the family for you. A battery can include cognitive tests and still be psychological testing if the referral is primarily diagnostic or emotional functioning, and it can include rating scales and still be neuropsychological testing if the core question is cognitive change or neurologic impact.

In practice, use this quick decision checklist before you submit:

  • Referral question: What is the one sentence question you are answering

  • Predominant focus: Are you mainly interpreting emotional and behavioral patterns or cognitive and neurologic patterns

  • Who did what: Which tasks were done by the psychologist versus a technician or trainee

  • Time capture: Do your timestamps separate interviewing and integration from administration and scoring

  • Common mismatch to fix: Do not bill evaluation time for straight administration just because you were present

If you’re short on time, start with just two items: the referral question sentence and a simple time log by task. Those two pieces often prevent the most expensive error, billing a code family that does not fit the real purpose of the evaluation.

Build documentation that survives audits and supports appeals

Next, shift your focus from “did I do good work” to “can a reviewer see the logic fast.” Audits and denials often come down to missing basics: why testing is needed now, what you did minute by minute, and whether billable time is clearly separated from non-billable time.

If you do one thing, make medical necessity impossible to misread. In 4 to 6 sentences, answer two questions in plain language: why testing is needed now and why simpler methods (like a brief screener, interview-only evaluation, rating scales, or records review) will not give a reliable answer for this referral question. Tie the need to a decision point such as school accommodations, return-to-work clearance, treatment planning, or differential diagnosis when symptoms overlap.

Common mistake: writing a vague line like “rule out ADHD” with no time pressure or functional stakes. Fix it by adding the prompt and the consequence, for example: “Client reports 9 months of attention problems with recent job performance warnings; testing is needed now to distinguish ADHD from anxiety and sleep disruption so the treatment plan and workplace recommendations are appropriate.”

Also, make your session notes audit-ready by separating (1) billable testing time, (2) non-billable breaks, and (3) administrative time that might be billed under different codes. Reviewers look for internal consistency, so your note should allow someone to add up minutes without guessing.

Include these elements every testing day:

  • Start and stop times for each testing block

  • Tests administered (name or test family) and the reason for selecting them

  • Behavioral observations tied to validity or interpretation (effort, persistence, frustration tolerance, need for redirection)

  • Accommodations or modifications (extra breaks, interpreter use, sensory supports)

  • Clear separation of non-billable breaks (snacks, phone calls, bathroom, waiting time)

Here’s the catch: “we took a break” is not enough if the total time is being billed. Label breaks as non-billable and note the time range, for example “10:42–10:52 non-billable break.” If you are short on time, at minimum capture start/stop times, tests, key observations, and break timing, then add fuller rationale the same day while details are fresh.

Make testing financially sustainable while staying ethically aligned

That said, sustainable testing usually comes down to pricing choices you can defend on paper and explain out loud.

Start by setting your fees intentionally across insurance and self-pay, so you are not quietly subsidizing complex evaluations with chronic underbilling. If you spend 8 to 12 hours total on a neuropsych eval (intake, testing, scoring, record review, report, feedback) but your typical insurance payment only covers a fraction of that time, the gap will show up as rushed reports, fewer test batteries, or constant after-hours work.

If you do one thing, calculate your real cost per evaluation before you change anything: estimate your average hours per case (for example, 10 hours), multiply by your target hourly rate, then compare it to the combined expected reimbursement (testing codes plus report/interpretation where applicable) or your current self-pay fee. Common mistake: keeping a “standard” self-pay rate that is lower than your effective insurance rate for similar work; fix it by mapping fees to time and complexity, then updating your fee schedule at set intervals (for example, every 6 to 12 months)

To avoid hidden subsidies, keep your financial policies consistent across referral sources and payer types, and make write-offs a conscious access decision rather than a default. Practical checks that keep you honest:

  • Track average total clinician hours per evaluation over your last 10 cases

  • Track average allowed amount and average patient responsibility for your top 3 payers

  • Flag any case type where the expected total payment is under your minimum viable amount

  • Decide in advance which cases you will accept at a reduced rate and how many per month

Next, protect access with sliding-scale or tiered options, plus transparent cost estimates before testing begins. A simple way to do this without confusion is to offer 2 to 3 tiers that match real differences in service level or time, then give clients a written estimate that separates what you control (your fee) from what you do not (possible insurance adjustments).

If you’re short on time, skip building a complex sliding-scale formula and start with two tiers plus a clear estimate template. For example:

  • Tier 1: Brief diagnostic clarification with a shorter test battery and a shorter report

  • Tier 2: Standard comprehensive evaluation (your most common package)

  • Tier 3: Complex differential or high-record-volume cases with added time for collateral and record review

Here’s the catch: transparency is part of ethics, not an admin task. Before testing starts, provide a plain-language estimate that includes expected range, what could change it (extra records, additional test sessions, missed appointments), and when payment is due, so clients can consent with full cost clarity.

Sustainability is an access issue when testing is hard to bill

When testing time is missed, coded inconsistently, or documented with gaps, reimbursement gets shaky and services shrink. In practice, that means fewer openings for referral partners, longer waits for families, and more pressure to shorten batteries or skip follow-up time that actually helps clients use the results.

If you do one thing, make every testing hour visible in your workflow. Track time the same day, tie it to the right code, and keep documentation tight enough that someone else could follow your logic weeks later without guessing.

What changes when every testing hour is tracked, coded, and documented with confidence

So ask yourself what would shift in your practice if each 15 minutes of testing services had a clear start point, a clear end point, and a clear record of what happened. Most clinicians see the same ripple effects: fewer claim corrections, faster follow-up after denials, and more predictable monthly cash flow even when caseloads fluctuate.

That consistency also protects clinical judgment. When your notes and codes match what you actually did, you can spend less time reconstructing billing decisions and more time doing the work clients came for.