Skip to content
EARNST.
Tracking & Compliance

First-Party Data Strategy 2026: Why Your Customer Data Is Worth More Than Your Ad Budget

Most SMEs spend 10,000 EUR/month on advertising, but the data generated belongs to Google. Building your own data asset changes everything.

EARNST · · 22 min read

Key Takeaways

  • Your Google Ads campaigns generate thousands of data points — but the data belongs to Google, not you
  • Third-party cookies are already dead for 40-50% of users. Without your own data strategy, you lose
  • Three stages: Own Visitor Identity, Engagement and Behavioural Data, CRM Integration
  • 90-day plan: From zero to your own data strategy — without a million-dollar budget

Your Google Ads campaigns generate thousands of data points every month: who clicks, who buys, who returns. But this data does not belong to you. It belongs to Google. And Google uses it — for you and for your competitors.

The analogy is simple: you are investing in a house, but the land belongs to someone else. When the landowner changes the rules — and they do, regularly — you are left with a house on someone else's ground. First-party data is the land.

This article shows how to build your own data strategy step by step. No million-dollar budget, no data science department, no CDP with a six-figure annual licence. Just the right infrastructure and a clear 90-day plan.

What first-party data is — and what it is not

The data pyramid

Zero-party data: Directly shared by the customer. Preferences, surveys, quiz answers, wishlists. The customer provides this data voluntarily and consciously. Highest quality, but hard to scale.

First-party data: Collected by you on your platforms. Website behaviour, shop interactions, email engagement, CRM data. You control the collection, storage, and usage. This is the core of this article.

Second-party data: Shared by partners. Hotel portal shares booking data with the airline. Relevant for large companies with partnership ecosystems, less so for SMEs.

Third-party data: Purchased from data brokers. Cookies and tracking pixels that collect across websites. Dying out — technically through cookie deprecation, legally through GDPR.

What counts as first-party data

  • Visitor Identity — own cookie, 13-month lifetime, independent of GA4
  • Engagement data — scroll depth, active time, product image interactions
  • Purchase history and cart data — what was bought, what was abandoned, which variants were compared
  • CRM data — email, segments, Customer Lifetime Value
  • Consent preferences — what the customer allowed, when, in which version
  • Cross-session behavioural data — products viewed across multiple visits, comparison patterns

What is not first-party data

The GA4 Client-ID belongs to Google. Safari deletes it after 7 days. Google can change the rules at any time. Google Ads Conversion Data resides at Google. You see reports, but the raw data is not yours. Facebook Pixel Data resides at Meta. Same principle. All three are rentals, not ownership.

Why third-party is dying — and what it means for your marketing

The timeline

In 2020, Safari ITP blocked third-party cookies completely. In 2023, Firefox ETP followed. In 2024, Chrome announced deprecation — then reversed, then announced again. In 2025, Google launched the Privacy Sandbox as an alternative, with limited reach. In 2026, the reality: third-party cookies are already dead for 40–50% of users. Not theoretically, not planned — now.

What this means in practice

Retargeting pools shrink. Every user who blocks third-party cookies vanishes from your retargeting lists. With Safari users (approximately 25% in the DACH region) and Firefox users (approximately 8%), that is already a third.

Lookalike audiences become less accurate. Less seed data means: the algorithms of Google and Meta have less material to find similar users. Quality declines.

Attribution becomes shorter. When cookies expire after 7 days, no attribution model can assign a purchase that happens 14 days after the first click. The entire purchase cycle is cut short.

CPA rises. Less data means: the algorithms have fewer signals for optimisation. Less optimisation means higher cost per conversion.

The way out: Those who have their own data are independent of platform decisions. Your visitor identity belongs to you. Your engagement data belongs to you. Your CRM data belongs to you. No browser update and no platform change can take that from you.

The 3 stages of a first-party data strategy

Stage 1: Own Visitor Identity (Foundation)

When Safari deletes the _ga cookie after 7 days, you still have your own ID. When Google changes the client ID structure, you still have your own ID. When an ad blocker blocks the GA4 cookie, you still have your own ID.

The technical implementation: a UUID per visitor, stored in a custom first-party cookie. Server-side cookie setting via the SST container gives the cookie a 13-month lifetime — even on Safari. The GA4 Client-ID is stored as a backup in the custom cookie, in case the _ga cookie is deleted. On login, the visitor ID links to the customer ID — giving you deterministic cross-device identity.

Stage 1 implementation checklist:

  • [ ] Custom visitor-ID cookie defined (name, structure, domain)
  • [ ] UUID generation implemented in tracking JavaScript
  • [ ] Cookie is set on first visit (first-party, Secure, SameSite=Lax)
  • [ ] Visit count incremented on every visit
  • [ ] GA4 Client-ID extracted from _ga cookie and stored in custom cookie
  • [ ] SST sets the cookie server-side (HttpOnly, 13-month lifetime)
  • [ ] dataLayer contains visitor_id, visit_count, is_returning on every pageload
  • [ ] On login: customer_id linked with visitor_id

Stage 2: Engagement and Behavioural Data (Enrichment)

Not just who was there, but how engaged — and across sessions. A visitor who swipes through 5 product images, spends 3 minutes on the page, and opens a product description has a different purchase intent than someone who bounces after 10 seconds. Without engagement data, both are equal to Smart Bidding.

The engagement score (0–100) per session quantifies this difference. Cross-session product interest stores which products a visitor viewed across multiple visits. Returning product view flags detect when someone views the same product a second time. Cart abandonment signals capture when someone fills the cart but does not purchase.

Stage 2 data inventory:

SignalStorageDurationUsage
Engagement ScoredataLayer → GA4SessionAudiences, Bidding
Products Viewed (Session)sessionStorageSessionFunnel analysis
Products Viewed (Lifetime)localStoragePersistentCross-session retargeting
Returning Product ViewlocalStoragePersistentUrgency messaging
Cart AbandonmentdataLayer → GA4EventEmail trigger, retargeting
Scroll DepthdataLayer → GA4EventContent optimisation
Active TimedataLayer → GA4EventEngagement segmentation

All data flows as GA4 custom dimensions into audiences. The audiences are imported into Google Ads. This gives Smart Bidding not just "was there" and "bought" as signals, but a spectrum in between. The knowledge article on tracking infrastructure shows the audience strategy with score ranges in detail.

Stage 3: CRM Integration and Predictive Audiences (Scaling)

The last mile: connecting online behaviour with offline data. Those who achieve this have a sustainable competitive advantage that competitors cannot copy — because they do not have the same data.

Customer Match: Upload CRM emails as audiences to Google Ads and Meta. Google and Meta match the hashed emails with their own user data and create high-quality lookalike audiences. Quality: significantly better than pixel-based lookalikes, because the seed data comes from actual purchasers.

Predictive Audiences in GA4: "Likely to purchase in 7 days" — GA4 automatically creates audiences based on machine learning. Requirement: sufficient purchase volume (over 1,000 purchases in 28 days). For shops with less volume: the manual score-based audiences from Stage 2 are the alternative.

CLV-based bidding strategy: High-CLV customers receive higher bids. A customer with 2,000 EUR annual revenue justifies an acquisition investment that a one-time customer at 30 EUR does not. Requirement: CLV per customer must be calculable (from Shopify or CRM).

Email segmentation by engagement score: Hot leads (score above 60) receive different email flows than casual browsers (score below 20). Higher relevance, higher open rates, more revenue from owned media.

RFM Analysis (Recency, Frequency, Monetary): Automated customer segments based on purchase behaviour. VIP customers, at-risk churners, new customers with potential. Each segment receives its own communication strategy.

Stage 3 readiness assessment:

  • [ ] CRM or email tool available (Klaviyo, Mailchimp, ActiveCampaign, etc.)
  • [ ] Customer list with over 1,000 email entries
  • [ ] GA4 User-ID matching activated (on login)
  • [ ] Google Ads Customer Match configured
  • [ ] Sufficient purchase volume for GA4 Predictive Audiences (over 1,000 purchases per 28 days)
  • [ ] CLV per customer calculable (from Shopify or CRM)

The privacy framework — What you may and may not do

What you may do with first-party data

  • Set own cookies (with consent)
  • Collect anonymous engagement data (scroll depth, active time — without PII)
  • Use hashed email addresses for Customer Match
  • Cross-session tracking on your own domain
  • Server-side cookie setting for longer lifetimes (13 months with consent)
  • Use CRM data for segmentation and personalisation

What you may not do

  • Store or transmit PII in plain text (always hash with SHA256)
  • Collect data without consent (Consent Mode defaults must be set to "denied")
  • Share data with third parties without legal basis (data processing agreement or explicit consent)
  • Track users across domains without explicit consent

First-party data compliance checklist

  • [ ] Consent Mode v2 correctly implemented
  • [ ] PII is SHA256-hashed before transmission
  • [ ] Privacy policy lists all custom cookies with purpose and duration
  • [ ] Cookie lifetimes are documented and limited (maximum 13 months)
  • [ ] Customer Match uploads use hashed emails
  • [ ] CRM data processing has legal basis (contract or legitimate interest)
  • [ ] Deletion requests are technically feasible (GDPR Art. 17)

The complete legal foundations are in the GDPR tracking guide.

ROI of a first-party data strategy — What it delivers

Short-term (month 1–3)

Own visitor identity delivers 13-month attribution instead of 7 days. That means: 10–20% more attributed conversions, because users who return after 14 or 30 days are attributed to the correct channel. Engagement scoring enables smarter retargeting: 10–15% lower CPA on retargeting campaigns, because budget is concentrated on hot leads rather than casual browsers.

Medium-term (month 3–12)

Cross-session data accumulates. Audiences become more accurate because they are based on real behaviour over weeks and months rather than a single visit. Broad Match and Smart Bidding benefit directly because conversion signals are more complete. Customer Match audiences deliver higher-intent lookalikes — better prospecting performance at the same budget. Email segmentation by engagement increases open and click rates — more revenue from owned media without additional ad costs.

Long-term (12+ months)

Your own data pool as an asset. Independent of platform changes, browser updates, and legislation amendments. CLV-based bidding makes acquisition more profitable — not just cheaper, but focused on the right customers. Predictive audiences enable proactive marketing instead of reactive retargeting.

What it is not

A quick fix. First-party data is a strategy that builds over months — but then grows exponentially in value. Every month adds data, every audience sharpens, every segment becomes more precise. After 12 months you have a data pool that competitors cannot copy.

The 90-day plan — From zero to your own data strategy

Day 1–30: Lay the foundation

  • [ ] Implement Consent Mode v2 correctly (or custom CMP)
  • [ ] Bring consent rate above 75%
  • [ ] Implement own visitor identity (custom cookie, UUID, visit count)
  • [ ] Set up SST (if not already present)
  • [ ] Activate Enhanced Conversions (Google and Meta)
  • Milestone: Tracking coverage above 80%

Day 30–60: Enrich data

  • [ ] Implement engagement scoring (scroll, time, interactions)
  • [ ] Activate cross-session product interest (localStorage)
  • [ ] Set up cart abandonment signal
  • [ ] Create GA4 custom dimensions for all signals
  • [ ] Define first audiences in GA4 (hot leads, cart abandoners, product comparers)
  • Milestone: 5+ active audiences in GA4

Day 60–90: Connect and scale

  • [ ] Import GA4 audiences into Google Ads
  • [ ] Differentiate bidding by audience segments
  • [ ] Set up Customer Match with CRM emails
  • [ ] Segment email flows by engagement score
  • [ ] Set up CLV calculation from Shopify or CRM
  • [ ] First performance analysis: before/after ROAS comparison
  • Milestone: Measurable ROAS improvement, own data pool growing

Conclusion

Google and Meta will have less and less data about your customers. Browser makers are tightening restrictions. Legislators are tightening rules. Every quarter, another piece of visibility disappears — for everyone building on third-party data.

The shops that build their own data will have an enormous competitive advantage in 2–3 years. Not because they spend more on advertising, but because their advertising optimises on better data. Same ad spend, better results, lower CPA, higher customer lifetime value.

This is not a data protection project and not an IT project. This is the most important strategic investment for your online business. And it is not rocket science: own cookie, engagement scoring, CRM integration. Three stages, 90 days, measurable results.

This article gives you the complete roadmap. You can start tomorrow. If you want, begin with the 15-point tracking audit — it shows you where you stand today. And if you do not want to do it yourself — we do it for you.

Our service

Tracking & Data Architecture

20–40% of your conversion data is missing. Server-side tracking, Consent Mode v2, 18+ events, and engagement scoring bring it back.

Learn more