Sources

Heartbeat reads from every system Satchel runs on. Every source arrives in marts-db via dlt, and every metric is built against landed data — never against a live source on a cockpit refresh. This page is the inventory: what's in, what's blocked, and the source priority rule that keeps the cockpit honest.

Source priority — webbank first

webbank is the source of truth for every metric. Use other landed sources (vss, aml, pdvs, card_management, currency_rate) only when the data webbank exposes is genuinely not there.

This was confirmed by the onboarding team on 2026-04-30 after a metric built on vss_vss.applicant_log diverged materially from what the team counts by hand against webbank.core.client_profile / client_profile_state_log / dict.profile_state. Whichever number we show on the cockpit must match the team's manual count, otherwise the metric loses trust the moment they look.

When a new metric is proposed via metrics-discovery:

webbank first — the skill checks webbank tables before any fallback.
fallback requires a notes entry on the metric explaining why webbank could not be used.
wrong source → no metric. If the right source is not landed, the metric is added to pending sources and the candidate is parked. We do not ship the metric off a wrong-but-landed source.

Landed sources

All 6 source DBs are landed in marts-db as of 2026-05-04. Total ~15 GB across 9 schemas.

Source	Role	Notable schemas
webbank	Source of truth for clients, payments, accounts, KYC.	`core` (193 t), `dict` (78 t), `scb` (20 t), `integration` (14 t), `eapi` (5 t)
aml	Sanctions / AML decisions (retrospective).	`aml`
pdvs	Payment dispatching / payee verification.	`core`
vss	KYC applicants / client verifications (legacy — being migrated to webbank).	`vss`, `dict`
card_management	Card issuing / lifecycle.	`card`, `dict`
currency_rate	FX rates.	`rate`

Operationally, as of 2026-05-04:

webbank dlt ingest is landed in full — webbank_core 156 tables / 12 GB (entry, payment_state_log, account_balance_log all ~10M rows), webbank_dict 80 tables / 3 MB. Backfill ran 2026-04-30 .. 05-04 in a one-off container on the dlt_data volume.
Any onboarding / KYC / payment metric currently sourced from vss_vss.* is a migration target — re-derive against webbank.core.client_profile / client_profile_state_log / dict.profile_state (and the corresponding payment / state-log tables) and verify the number matches the team's hand count before flipping the metric over.

Postgres topology

We've made the same wrong assumption about Postgres twice already (treating "webbank" as a single DB; treating SEPA as part of the webbank cluster). The picture below is verified against pg_database on the live clusters and against the local schema-only mirror at seed/schemas/. Treat this as the source of truth — when in doubt, re-verify against the live cluster, not against memory.

Cluster A — webbank cluster

Host 16.170.155.54, postgres 14.2. 9 sibling DBs on one cluster. Same Postgres user reaches all of them with one credential once GRANT SELECT is in place per DB.

DB	What's in it	SELECT granted?
`webbank`	Core banking — accounts, customers, payments, ledger, dictionaries	yes — `dashboard_user`
`aml`	AML alerts / retrospective	pending
`pdvs`	Payee verification	pending
`vss`	KYC applicants / client verifications	pending
`card-management`	Card issuing / lifecycle	pending
`currency-rate`	FX rates	pending
`bpm`	Workflow / BPM	pending — see pending sources
`notification`	Notification log	pending
`vmi`	(TBD — pulled in dump but role unclear)	pending

Cluster A is one credential, many DBs. Adding aml/vss/pdvs/etc. is a GRANT request, not a new connection setup.

Cluster B — SEPA cluster

Separate Postgres, host TBD. 3 DBs, fully independent of cluster A — different credentials, different network reachability.

DB	What's in it
`mms`	SEPA outgoing payments
`mms_inst`	SEPA Instant payments
`mss_kart`	(TBD)

Cluster B is its own infrastructure ask. Different host, different creds — does not piggyback on the cluster A request.

Local mirror for exploration

DDL only, no data:

seed/schemas/prod/ — cluster A dumps (gitignored).
seed/schemas/prod-sepa/ — cluster B dumps (gitignored).
docker compose -f docker-compose.dev.yml up -d postgres-schemas — container at localhost:5433, all 12 DBs loaded, user explorer / explorer.
seed/schemas/erd/webbank-core/index.html — SchemaSpy ERD for webbank.core (186 tables with relationships).

MVP source plan

The 24-04-2026 offsite produced a 15-source MVP shortlist. We deliberately narrow that to 4 sources for the first end-to-end build — chosen so that every ingest type that we'll later need is exercised at least once.

#	Source	Type	Why this one specifically	Owner	Access
1	webbank Postgres	Postgres replica	Core banking truth. Already wired.	Den	connected
2	BO API	Internal REST	Back-office state not in webbank tables (operator actions, manual interventions, queues). Exercises the "internal HTTP" pattern without SaaS quirks.	?	unknown
3	Zoho CRM	SaaS REST (OAuth)	Funnel truth: leads, deals, pipeline. Exercises the SaaS-OAuth pattern.	?	unknown
4	Google Analytics	SaaS REST (service account, query API)	Top-of-funnel: traffic, sources, conversions. Exercises the SaaS-aggregation-API pattern.	Дима (devs.ua)	unknown

These 4 cover the 4 ingest archetypes we expect across the wider MVP-15 (own-Postgres / internal REST / SaaS OAuth REST / SaaS aggregate API). The remaining 11 (Tx monitoring, Bank providers, Collateral, Fireblocks, Ondato, Freshchat, Zendesk, YouTrack, registration/contact forms, accounting) come after the ingest pattern is proven on these 4.

Pending sources

Source databases that Heartbeat needs but has not yet landed. Each entry names the metrics blocked on it. Until landed, the affected metrics are either removed from the cockpit (file deleted from db/metrics/) or carry an explicit caveat in their notes.

BPM (Business Process Management)

Why we need it. BPM holds the AML compliance review state for outgoing payments. Webbank only sees the payment as authorized while BPM holds the case for manual compliance review (can be days). Without BPM we cannot:

subtract AML hold-time from outgoing-payment processing-time metrics (so the metric currently reads "bank + compliance + random hold" instead of "bank only");
show the full AML screening picture — BPM has the breakdown across incoming / monetary / ongoing screening types.