Skip to main content

From the docs and crawling this forum, I understand that the user merging process is retroactive, so if after an anonymous user logs in, and are identified with a user_id, the user_id of the anonymous events will be retroactively set. 

 

But I’m having trouble confirming this in practice. If I do the below sequence:

  1. New session from incognito window
  2. Fire an event (page view) as an anonymous user
  3. Log-in (with a user that has had previous activity and is not new)
    • I’ve confirmed this does indeed merge the existing user to the anonymous user via User Lookup page. 

And then I do the following segmentation on Amplitude ID and “_new”, Amplitude New User Event (for context the Jan 23rd date is when we implemented amplitude). 

 

Which shows this user as a “New” user, but they should not be new after the merging process, since the merged User ID was not “new”. 

 

How is this possible? If I extend the above analysis not just for a single Amplitude ID, but for the entire user base, are anonymous users getting counted as New when they shouldn’t be and throwing off my entire “new” vs. “existing” analysis?

 

Separately, but related, if I look at all “New” users and then segment by: 1) User ID = “none” and 2) User ID != “none”, the total count of the segments is larger than if the only segment was all users. How is this possible? Doesn’t this prove that some double counting exists?

 

Below is total new users

 

And then here is total new users segmented by “Anonymous” (User Id = “none”) and Logged-in. 

 

Thanks for posting here @racarman245. Regarding Q1, Amplitude’s merging process is retroactive. The data is merged if the anonymous user logs into their existing account within the same session, and all subsequent events are mapped to that user's existing Amplitude ID. For tracking unique users in Amplitude, I've attached this document here to go further in depth.

Some additional reasons for unexpected results could be issues with event timestamps, deduplication, and user identification. For Q2, you may want to exclude events with a "none" user ID from your analysis entirely. I hope this helps!


Thanks @Jeremie Gluckman ! I referenced that article in my investigation as well. 

Even after re-reading, I’m not seeing how the above screenshot is possible, i.e. how a user is counted as “new” after the retro-active merging (the user shouldn’t only be counted as an existing user).

 

If I am to exclude “none” from analysis, how else can I analyze “new” users that are anonymous vs. not-anonymous (logged-in)? 


Reply