Hints: Sakurako's Test

Hint 1 ↕

There is a very standard trick of converting Median based problems to Sum based problems. I learnt it from a recent Codeforces Problem : Med-imize

Notice that $1 \leq a_{i} \leq n$ . So, let’s draw a number line from $0$ to $n$ where $l i n e [n u m]$ represents the frequency of $n u m$ .

From this number line, how do you locate the median?

Answer to Hint 1 | Hint 2 ↕

Let’s define $h a l f = (n + 2) / 2$ .

The median would be the first position where the prefix sum of this number line is $\geq h a l f$ .

Now, I’ll show you an alternate way to compute the median.

If you define $f (m e d)$ to be true if the median of the array is $m e d$ . Is $f$ monotonic?

Answer to Hint 2 | Hint 3 ↕

That’s a trick question. $f$ is not monotonic. Only 1 value of $f$ would be true and all other values would be false.

So, let’s modify it a bit. Define $f (m e d)$ to be true if the median of the array is $\leq m e d$ . Is $f$ monotonic now?

Answer to Hint 3 | Hint 4 ↕

Yes, because if the actual median is $a c t u a l_{m} e d$ , then all $f (i)$ where $i < a c t u a l_{m} e d$ will be false and all $f (i)$ where $i \geq a c t u a l_{m} e d$ will be true.

Hence, we can binary search on this $f$ to locate the median of an array. Suppose we want to check if median is $\leq m i d$ . Then, if the total number of elements in $[0, m i d]$ is $\geq h a l f$ , then $f (m i d)$ is true.

So, even though you could traverse from left to right and compute the median at the first point where the prefix sum becomes $\geq h a l f$ , I’ve just showed you a binary search technique to locate this median. But why? Because this technique will be useful for us when there are modifications to array elements. Notice that the monotonicity of $f$ does not change even after modifications.

So now, suppose you are given an $x$ and you want to figure out if the final median can be $\leq m i d$ . How do you do it?

Answer to Hint 4 | Solution in O(N*Q) ↕

$f (m i d)$ will be true only if we can stuff atleast $h a l f$ elements to the left of $m i d$ . Why don’t we be a bit lenient and stuff eveything to the left of $m i d$ if it is possible (there’s no harm in having a large number of elements there). And even after this leniency, if the count remains less than half, we know that it median has to be $> m i d$ .

Recall Euclid’s divison lemma

$a [i] = q \cdot x + r e m$

Therefore, the leftmost position that we could stuff $a [i]$ into is $r e m$ , so let’s do it.

For each $a [i]$ , increase frequency of $a [i] % x$ . Then, if the prefix sum till $m i d$ is less than half, the $f (m e d)$ is false, else it is true.

Hint 5 ↕

Notice that

x

can be only in the range

[1, n]

. So let’s precompute the answer for all

x

. How do you avoid recomputation of the prefix sum?

Hint 6 ↕

Think about cyclicity of modulo and sum of Harmonic Series.

Hint 6 in more detail ↕

Notice that on the number lines, the modulo would look like

$0, 1, 2, x, 0, 1, 2, x$

If $m e d < x$ , then it means that you can only take a prefix of elements of size $m e d$ starting from every $0$ . So, let’s brute force all positions of such $0$ (under modulo $x$ ), and take the subarray sum of size $m e d$ starting from that element.

What is the total number of occurrences of such 0s across all $x$ from $1$ to $n$ ?