Best Practices to Minimize Data Sent to Branch
Overview
To protect user privacy, users of Branch’s services should not send any data to Branch through our services that are considered personally identifiable information (PII). PII includes, but is not limited to, information such as full name, username, email addresses, and account numbers.
When configuring Branch on a digital property, follow the best practices in this document to minimize the data sent to Branch. Because laws across countries and territories vary, and because the Branch services can be used in many ways, consult legal counsel to confirm the sensitivity of the data you plan to send to Branch and any associated legal obligations.
This document will help you understand:
- How Branch treats different types of data—depending on whether they are categorized as personal data, PII, pseudonymous data, or anonymous data.
- Why it’s important to secure personal data before sharing with Branch or other third parties.
- Treatment of various data parameters across Branch’s services.
PII, Personal Data, Pseudonymous & Anonymous Data
From a privacy perspective, data is often classified into four categories: personally identifiable data, personal data, pseudonymous data, or anonymous data. There are also additional categorizations of data depending on the data protection regulatory scheme, such as protected health information. In general, personally identifiable information includes the more sensitive forms of data that can identify individuals. Unless expressly set out in the permitted data fields below, personal data, PII and other sensitive data should either not be shared with Branch at all or should be pseudonymized or anonymized before being shared with Branch if at all. Below we discuss these different categories of data and some best practices for pseudonymization and anonymization.
What is Personally Identifiable Information?
At Branch, we treat data as personal data or personally identifiable data depending on the laws of the relevant jurisdiction. Generally, Branch interprets Personally Identifiable Information (PII) as information that could be used on its own to directly identify, contact, or precisely locate an individual.
This includes things like:
- email address
- mailing address
- phone number
- precise location (such as GPS coordinates of four decimal places or more)
- full names or usernames
Branch does not collect PII to provide our services and strongly advises customers to ensure they are not sending PII to Branch inadvertently.
What is Personal Data?
Personal data is a relatively broad category of data that was introduced in 2018 under the European General Data Protection Regulation and is any information that relates to an identified or identifiable living individual such as IP address, cookie ID, and advertising identifiers like IDFA and GAID. Branch collects Personal Data from customers to provide our services.
What is Pseudonymous Data?
Pseudonymous data is data that is rendered de-identified via a technique that replaces personal data or PII fields in a data record with pseudonyms so that the data record can no longer be attributed to an end user without the use of additional information. Pseudonymization is accomplished by ensuring that identifying reference information is maintained separately via technical and organizational measures to prevent re-identification to an end user.
What is Anonymous Data?
Anonymous data is not linked to an end user’s device and does not specifically identify an end user in any way nor provide the potential for future identification of that end user. For instance, aggregated data that you view in your Branch dashboard on the number of installs in a given time period is anonymous data.
What is Hashing?
Hashing (also known as obfuscation) is a pseudonymization technique whereby a text string (e.g. personal data or PII field) is transformed into an arbitrary numerical value. Hashing is a useful tool for pseudonymizing personal data and PII before sharing it with third parties for a few reasons:
- Hash values are unique, in that it is extremely unlikely that another text string will produce the same hash value.
- Hashing is deterministic, in that the same text string will produce the same hash value every time.
- Hash values are “one-way”, in that it is not possible in practice to take the hash value and reverse the algorithm to reveal the original text string.
Branch utilizes automatic hashing techniques to minimize the personal data we process. Additionally, to the extent you utilize PII in your app’s data records, you should ensure that you hash any such PII before sharing that data with your third-party service providers like Branch to practice data minimization.
Standard Hashing Algorithms
There are several standard hashing algorithms you can use to encrypt personal data and PII and securely pass it to Branch and other third parties, such as:
- MD5 – Produces a 128-bit (16-byte) hash value, typically rendered as a 32 digit hexadecimal number.
- SHA-1 – Produces a 160-bit (20-byte) hash value, typically rendered as a 40 digit hexadecimal number.
- SHA-256 – Produces a 256-bit (32-byte) hash value, typically rendered as a 64 digit hexadecimal number.
Hashing Example
Say, for example, your end-user has provided you with their email address – [email protected] – which you recognize to be PII and therefore seek to hash it before sending this data to third-party service providers like Branch.
Customer Email Address | [email protected] |
MD5 Hashed Value | 6a6c19fea4a3676970167ce51f39e6ee |
SHA-1 Hashed Value | fd9c796f4269b3484f9ef436627d0d1cb35071c5 |
SHA-256 Hashed Value | d709f370e52b57b4eb75f04e2b3422c4d41a05148cad8f81776d94a048fb70af |
When hashing your data, keep in mind that capitalization and punctuation matter. [email protected] will produce a different hash than [email protected]. The same is true for including parentheses, spaces, or dashes in phone numbers.
Data at Branch
As part of Branch’s guiding Privacy Principles, we practice data minimization, which means that we avoid collecting or storing information that we don’t need to provide our services. The personal data that we collect is limited to data like advertising identifiers, IP addresses, and information derived from resettable cookies. In addition, we take a variety of measures to make sure personal data like this isn’t stored in raw form in our systems for longer than we need it to provide our services.
Data Branch Automatically Hashes
Branch has implemented automated hashing mechanisms for data parameters that are expected to or could reasonably include personal data. The following data parameters are automatically salted and sha256 hashed per Our Data Retention Policy and stored only in this hashed form thereafter:
- user_data_aaid - The Google Advertising ID associated with the user.
- user_data_android_id - The Android ID associated with the user.
- user_data_browser_fingerprint_id - The web browser fingerprint ID associated with the user.
- user_data_randomized_device_token - The device fingerprint ID associated with the user.
- user_data_developer_identity - The developer-specified identity associated with the user.
- user_data_geo_lat - Latitude, derived from IP address (above), set automatically server side.
- user_data_geo_lon - Longitude, derived from IP address (above), set automatically server side.
- user_data_idfa - The iOS advertising id of the device where the event occurred. specified by the client.
- user_data_idfv - The iOS vendor id of the device where the event occurred. Scoped to a vendor e.g. Facebook: idfv is the same for Facebook and Messenger but different Twitter. specified by the client.
- user_data_ip - The IP address from which the API call tracking the event originated; set automatically server side.
- user_data_persona_id - The ID assigned by Branch’s Persona associated with the user.
- last_attributed_touch_data_custom_fields
- $aaid - The Google Advertising ID associated with the user.
- $idfa - The iOS advertising ID of the device where the event occurred; specified by the client.
- +url - %24idfa and %24aaid fields are appended to the '+url' as a query param. As such they are stripped and hashed and appended back to the '+url' field.
The following fields are also hashed, but after 60 days rather than 7 days. In the case of JSON fields, the listed key-values are hashed rather than all the data in the top-level JSON object).
- last_attributed_touch_data_plus_referring_domain
- last_cta_view_data_plus_referring_domain
- user_data_http_referrer
- install_activity_touch_data_plus_referring_domain
- install_activity_touch_data_additional_data_plus_referring_domain
- reengagement_activity_touch_data_plus_referring_domain
- reengagement_activity_touch_data_additional_data_plus_referring_domain
- last_attributed_touch_data_custom_fields, install_activity_touch_data_additional_data_custom_fields, & reengagement_activity_touch_data_additional_data_custom_fields
- +referrer
- +url
- $original_url
- $fallback_url
- $fallback_url_xx
- $canonical_url
- $desktop_url
- $ios_url
- $ipad_url
- $android_url
- $samsung_url
- $huawei_url
- $windows_phone_url
- $blackberry_url
- $fire_url
- $ios_wechat_url
- $android_wechat_url
- $android_deeplink_path
- $ios_deeplink_path
- $deeplink_path
- $og_image_url
- $og_video
- $og_url
- $twitter_image_url
- $og_image
- +apple_search_ads_attribution_response
- email_address
- referral_email
- emailId
- userEmail
- referrerEmail
- pw
- password
PII Best Practices
To protect user privacy, Branch policies strongly discourage you from passing us any data that we could recognize as personally identifiable information (PII).
Branch only collects (and soon thereafter hashes) the Personal Data listed above.
Branch does not intend to collect any of the following PII:
- Email addresses
- Account numbers
- Mailing addresses
- Full names or usernames
- Protected Health Information protected by HIPAA
This is by no means an exhaustive list - if the specific data parameter is not the list in our Privacy Policy and you believe it to be PII or otherwise sensitive data, chances are you shouldn’t be sending it to Branch without at least anonymizing or pseudonymizing it first.
Please follow the best practices outlined below to reduce the risk of sending this PII to Branch.
Do Not Send Raw PII to Branch
- Do not send raw PII to Branch via any data parameters you configure to collect information.
Hash PII Before Sending to Branch
- If you do need to send PII through Branch, you must hash it before sending the data to Branch; ideally using the sha256 hashing function.
- Branch does not need to collect the PII fields mentioned above for our services to work, and Branch does not support dedicated parameters for you to pass their hashed versions. If you need to send these hashed fields through Branch, you can do so by passing them into a custom data object via API or SDK. You should work with your account team to make sure this is done correctly before sending this data to Branch.
"custom_data": {
"user_phone_sha256": "15e2b0d3c33891ebb0f1ef609ec419420c20e320ce94c65fbc8c3312448eb225",
"user_email_sha256": "d709f370e52b57b4eb75f04e2b3422c4d41a05148cad8f81776d94a048fb70af"
},
Developer Identity
- Developer Identity: Branch customers can optionally set a custom parameter called Developer Identity by calling the .setIdentity method on our SDKs (Android, iOS, Web). Generally this is done when a user logs in and should represent some sort of non-PII user ID.
- If you plan on using your own user identifier to join your offline data with Branch’s data, there are a few things to keep in mind when choosing what value to use as your Developer Identity.
- You may NOT use an identifier for Developer Identity that contains personally identifiable information (PII). This rules out email addresses, user logins, social security numbers, phone numbers, or any piece of data that is deemed to be "PII".
- You CAN use non-obfuscated alphanumeric database identifiers that you might create for your visitors. Another acceptable option is to pass to Branch a hashed or encrypted identifier that is based on PII but that is not Protected Health Information (as defined under HIPAA), as long as you use properly strong hashing or encryption. Branch recommends the use of a strong hashing algorithm and a salt.
- If you want to join Branch data back to your own unhashed internal identifiers, we recommend creating your own internal reference table that matches the raw PII or personal data with its hashed counterpart. Doing so will enable you to line up exported historical Branch data with your own internal databases without sensitive, unhashed data leaving the relative safety of your own systems.
PII in URLs
- URLs: Email is often used for verification as part of site registration and sign-up processes, or in the context of password resets or newsletter signups. These verification emails can sometimes inadvertently include PII in the confirmation/registration link. For instance,
site.com/confirm?email=sample%40email.com&token=413203
. - If the confirmation page’s URL contains PII and the page hosts Branch’s Web SDK, PII could accidentally end up in Branch’s systems as part of the URL string.
- Check links for PII: Sign up for an account. Check if the URL in the verification/confirmation email includes the email address or other PII. As a best practice, you should never include raw PII in URLs under any circumstances. Instead, hash or encrypt this PII or use a safer, pseudonymous identifier instead.
PII in Branch Link Data (Including Query Parameters)
- As outlined above, it is against Branch’s policies to include raw PII in Branch link data. This includes adding PII as query parameters to an otherwise “clean” Branch Link, since Branch will store the full URL
- For example, imagine a Branch link with a raw email appended as a query param, like this:
https://www.test.app.link/abc123?email=sample%40email.com&token=413203
. Branch will store the full URL, including the unwanted PII, in our systems when a user clicks on this link. - As a best practice, make sure you’re not including raw PII in Branch link data (even as query parameters added after link creation) by hashing or otherwise pseudonymizing it before appending it.
HIPAA Disclaimer
- Unless otherwise specified in writing by Branch, Branch does not intend uses of the Branch Services to create obligations under the Health Insurance Portability and Accountability Act, as amended, (“HIPAA”), and makes no representations that the Branch Services satisfy HIPAA requirements. If you are (or become) a Covered Entity or Business Associate under HIPAA, you may not use the Branch Services for any purpose or in any manner involving Protected Health Information unless you have received prior written consent to such use from Branch.
Updated 5 months ago