PROOF Series #1 — Compliance Coding Rubric

Document version: 1.0 — Drafted May 14, 2026 Author: Shawn Lai, WTT Digital Purpose: Operationalize California Rules of Professional Conduct 7.1 and 7.4 into a scored, reproducible categorical scheme for law-firm web content.

What this rubric is for

The PROOF Series #1 study asks whether material California Rule 7.1 compliance correlates with AI-search citation rates. To answer that question rigorously, "material compliance" has to be operationalized as something more precise than a subjective read. Two independent coders applying this rubric to the same site should arrive at substantially the same score.

This document is the rubric. It is published as a standalone artifact so that the methodology can be audited, replicated, or challenged before data collection begins. Pre-registration of the analysis plan that uses this rubric is published separately.

Scope of the rubric

The rubric applies to public-facing law firm web content in the following surfaces, in this order of priority:

The homepage of the firm's primary domain
The "About" / "Our Firm" page
Each named practice-area page
Each named attorney bio page
The meta description and Open Graph tags of each of the above

It does NOT cover:

Off-site content (Avvo profiles, Justia listings, JD Supra bylines) — these have their own platform-specific rules and are out of scope for this study's compliance scoring
Paid advertising content (Google Ads, Meta Ads) — separate Cal Bar rules apply
Email content — typically not public-facing
Social media content — separate study would be required
PDF or downloadable assets — too variable across firms to code consistently

The rubric is built for California firms. Firms licensed in multiple states are scored against California rules only; multi-state extensions are noted but not scored.

The five flag categories

Each surface listed above is scored on five flag categories. Each flag is binary (present / absent) but weighted differently based on Cal Bar enforcement priority and AI-citation impact (per the working hypothesis).

Flag 1: Specialization / specialist claims (weight: 3)

Trigger phrases (case-insensitive, partial match counts): - "specialist" / "specialists" - "specializing in" / "specialize in" / "specializes in" - "specialization" - "specialty"

Rule basis: California Rule of Professional Conduct 7.4(c) reserves the words "specialist," "specialize," and related forms for attorneys formally certified by the State Bar's Legal Specialization program.

Permitted exceptions (no flag if): - The phrase appears with explicit reference to a State Bar Certified Legal Specialist designation (e.g., "Certified Specialist in Family Law, California State Bar Board of Legal Specialization") AND the attorney holds that certification - The phrase appears in a direct quotation from a third party (e.g., a court ruling, a news article) and is clearly attributed - The phrase appears in the firm's name itself (rare but possible for grandfathered firm names)

Scoring: 1 point per surface where the trigger appears outside the exception, up to a max of 5 points per firm (homepage + about + three practice-area pages). Weighted 3x. Maximum contribution: 15 points.

Flag 2: Superlative quality claims (weight: 2)

Trigger phrases: - "best" (when describing the firm, its attorneys, or its services) - "top" / "top-rated" / "top-tier" - "leading" / "premier" / "preeminent" - "most experienced" / "most respected" / "most trusted" - "award-winning" (without proximate award identification) - "exceptional" / "outstanding" / "unparalleled" - "elite" - "world-class"

Rule basis: California Rule 7.1 Comment [3] flags language likely to create "unjustified expectation." Florida Rule 4-7.13 (used as a stricter comparative since California firms with multi-state practices often face Florida exposure) explicitly prohibits superlatives without verifiable third-party rating disclosed inline.

Permitted exceptions (no flag if): - The superlative is supported by a named third-party rating in the same paragraph or within 100 words (e.g., "named a Super Lawyers Top 100 attorney in California, 2024-2025") - The superlative appears in a direct quoted client review (testimonials have their own disclaimer requirements but the superlative itself is not the firm's claim) - The superlative refers to a measurable, verifiable attribute (e.g., "the most settlements above $1M in San Francisco County in 2024" with the underlying data citable)

Scoring: 1 point per superlative instance, up to a max of 10 points per firm. Weighted 2x. Maximum contribution: 20 points.

Flag 3: Predictive outcome language (weight: 3)

Trigger phrases: - "guarantee" / "guaranteed" / "we guarantee" - "we will get you" / "we'll get you" - "we will recover" / "we'll recover" - "successful outcome" / "successful result" (predictive) - "we ensure" / "we fight to ensure" - "winning result" (predictive) - "you will" (in any predictive context — "you will receive," "you will recover") - "count on a [positive outcome]"

Rule basis: Cal Rule 7.1 Comment [3] directly addresses language likely to create unjustified expectation about results. This is the highest-risk category because it most cleanly fits the rule's stated trigger.

Permitted exceptions (no flag if): - The phrase appears in a past-tense factual statement (e.g., "We recovered $2.4M for our client" — this is a past-results claim, scored under Flag 4 instead) - The phrase is part of a generic ethical commitment ("we guarantee our commitment to client communication" — describes process, not outcome) - The phrase appears in a quote attributed to a client testimonial (separate scoring path)

Scoring: 1 point per instance, up to a max of 8 points per firm. Weighted 3x. Maximum contribution: 24 points.

Flag 4: Past results without proximate disclaimer (weight: 2)

Trigger: - Specific monetary recoveries or settlements stated by amount (e.g., "$2.8M jury verdict," "recovered $1.2M for our client") - Specific case outcome descriptions (e.g., "won dismissal," "prevailed at trial," "successfully defended") - Phrases summarizing patterns of outcomes (e.g., "successfully recovered substantial amounts," "track record of favorable verdicts") - Named precedential cases the firm worked on (when used to imply future capability)

Rule basis: Cal Rule 7.1 Comment [3] plus the standard Cal Bar past-results disclaimer requirement.

Required disclaimer language (any of the following counts as adequate): - "Past results do not guarantee a similar outcome." - "Past results do not guarantee future outcomes; each case is evaluated on its own facts." - Substantially equivalent language conveying the same meaning

Disclaimer placement requirements: - Same page as the past-results reference - Within 200 words of the reference, OR in a clearly visible footer/sidebar that applies to all past-results content - Same surface (a disclaimer in a separate "Disclaimer" link page does NOT count — the disclaimer must be reachable in the same view)

Scoring: 1 point per past-results reference without an adequate proximate disclaimer, up to 8 points per firm. Weighted 2x. Maximum contribution: 16 points.

Flag 5: Fee and free-consultation language without required qualifiers (weight: 1)

Trigger phrases: - "No fee unless we win" / "no fees unless we WIN" / variations - "Free consultation" (when paired with personal injury or contingency-fee practice) - "Contingency fee" (when used without clarifying client cost responsibility) - "No upfront fees"

Rule basis: California Rule 7.4(d) requires that contingency-fee communications disclose that the client may remain responsible for costs and expenses regardless of outcome. The phrase "no fee unless we win" without that disclosure is technically non-compliant.

Required disclaimer language: - Some form of: "Client may be responsible for costs and expenses regardless of outcome." OR - "Costs and expenses are separate from attorney's fees."

Disclaimer placement: Same page, proximate (within the same content block or a clearly linked footer on the same page).

Scoring: 1 point per instance without proximate qualifier, up to 5 points per firm. Weighted 1x. Maximum contribution: 5 points.

Aggregate scoring

Total possible flag points: 15 + 20 + 24 + 16 + 5 = 80 points maximum.

A firm with zero flagged content scores 0. A firm with significant exposure across all five categories approaches 80.

Classification cutoffs for the study

For the PROOF Series #1 analysis, firms are classified into three bins:

Bin	Score range	Label
A	0 – 8	Materially compliant
B	9 – 24	Mixed signals
C	25 – 80	Materially non-compliant

Cutoffs are chosen so that Bin A captures firms with at most isolated borderline issues (e.g., one unqualified superlative) and Bin C captures firms with patterned issues across multiple categories. Bin B is the gray zone and is reported separately but not used as the primary comparison group.

The primary study comparison is Bin A vs. Bin C for citation-rate analysis. The Bin B middle is reported descriptively but not used to test the central hypothesis, because the categorical contrast is what gives the hypothesis test its power.

Coder protocol

To ensure inter-rater reliability:

1. Two independent coders score each firm. No discussion until both have completed scoring.

2. Each coder reviews the five surfaces in the documented order (homepage → about → practice-area pages → bios → meta tags). Scoring happens surface by surface, not by flag category — the coder applies all five flag categories to each surface before moving to the next.

3. Coders use a structured spreadsheet with one row per (firm, surface, flag) tuple. The aggregate firm score is calculated, not assigned.

4. Disagreements resolved by adjudication. Where the two coders' scores differ, a third coder reviews and either ratifies one of the two scores or assigns a new score. The third-coder score is final. All disagreement instances are logged for the methodology appendix.

5. Inter-rater reliability is reported. Cohen's kappa across coder pairs is calculated and reported in the final paper. Pre-study target: kappa ≥ 0.70 (substantial agreement). If pilot kappa falls below 0.65, the rubric is revised before full-study coding begins.

Adapting the rubric for other states

This rubric is California-specific. Adaptations to other state rules would adjust:

Florida (Rule 4-7.13): Flag 2 (superlatives) weighting increases; testimonials require explicit handling; pre-filing review requirement is referenced but not scored (pre-filing is process, not content)
New York (Rule 7.1): Flag 4 (past results) disclaimer language becomes more specific ("Prior results do not guarantee a similar outcome" must appear in the same communication, verbatim)
Texas (Disciplinary Rule 7.02): Flag 3 (predictive outcomes) weighting increases; the rule's "unjustified expectation" trigger is enforced more actively than in California

For the present study, California-only scoring keeps the methodology clean. Multi-state extensions are future work.

Limitations of this rubric

The rubric does not capture:

Implicit testimonial language ("clients consistently report...") that implies a testimonial without quoting one. This is a real Cal Bar issue but coding it consistently requires more interpretive judgment than the rubric's structure allows. Future versions may incorporate.
Image and video content. A firm video saying "we guarantee results" violates the same rules but is not in scope for this rubric, which scores text only.
Hidden content. Content behind logins, paywalled assets, or interactive widgets that don't render statically is not scored.
Dynamic content variation. A site that randomizes its hero text or A/B tests headlines may present different content to different visitors. The coder uses a single snapshot per surface and notes any obvious variation; longitudinal variation is out of scope.

These limitations are acknowledged so the paper's claims stay calibrated to what was actually measured.

Version history

v1.0 (May 14, 2026): Initial published version. To be referenced by the PROOF Series #1 pre-registration document.

Future versions will be tracked here. Substantive changes after data collection begins require a methodology update and a re-coding pass.

This rubric is part of the PROOF Series #1 study materials. The full pre-registration document is at wtt.digital/proof/cal-bar-citation-study/pre-registration. Methodology questions, replication interest, or peer-review feedback can be sent to [email protected].