Painting Cats

August 4, 2020 · 7 min read

Suppose you have a row of N cats, numbered from 0 to N - 1. Initially, all the cats are painted black. You are asked to perform Q operations of the following two types, in random order. 1.Painting Cats: Given a range a to b(inclusive), cats within the range are painted with a new color. Example input:

Type of operation | Start index | End index | Color

1 0 10 blue

No output (void)

2.Query Cat: Given a particular number a, return the color of the cat numbered a. Example input:

Type of operation | number

2 10

Example output:

blue

It is guaranteed that 1 ≤ N ≤ 1,000,000,000 and 1≤Q ≤ 300,000. Create a time and space efficient solution.

I could not answer this question without a hint given by my classmate. Not a difficult question to understand, so feel free to think of a solution if you can, before reading on.

Question Analysis

The difficulty lies in the time and space complexity requirements. Let's ignore them for a while and come up with a naive solution.

Naive approach

One possible way to solve the problem is to simply do as told. Initialize an N sized array and assign every slot to be of the color blue. When handling paint-cat operation, simply loop over the array from the starting index to the end index, updating the existing color to the new color. During query-cat operation, simply index to the slot in the array and output the color in that slot.

There are a few issues with the above naive approach. One being that N could be too large to handle. This means our array will take out substantial space to solve this problem. Besides, the time complexity of the paint-cat operation is O(N) because we touch every cat within range per operation. The upside to this solution is that the query-cat operation is O(1) thanks to the array indexing. In summary:

Space:

O(N) array setup

Time:

O(N) per paint-cat, worst case O(QN), Q operations of paint-cat
O(1) per query-cat, worst case O(Q), Q operations of query-cat

Getting better

Looking at the limiting factors, realize that we cannot represent every single cat, or else we will have trouble making it space-efficient. As for the two operations, we need sublinear time complexity for both. There are a few choices, in terms of data structure, that can achieve sublinear complexity: ADTs:

Map (implemented by Hashtable):

O(1) insert, O(1) update, O(1) query

Priority Queue (implemented by Binary Heap)

O(logn) insert, O(n) update, O(logn) query

Ordered Map (implemented by Balanced Binary Search Tree)

O(logn) insert, O(logn) update, O(logn) query

Binary Heap is out since it requires O(n) update in its standard implementation, although it could achieve O(logn) update with an additional Hashtable to keep track of positions. Both Hashtable and bBST are possible. If we need to maintain some sort of order, we have to go with bBST.

Range Update, Point Query

The question, in effect, can be summarized by the above subheading. The key then is to represent the range and find out how we can query points from the new representation.

I thought about it for a long time. Certainly, we are employing a strategy just like using cones to mark out boundaries in a football field. A range of cats can be represented by the starting index and the ending index. To query a cat, we will look towards its left or right, till we find a demarcating cat. The approach is in the right direction. What puzzled me was how to update cats so that the boundaries are set correctly and point-query can be done properly.

Suppose we update the color of the starting index and the ending index cat. A few runs of operations are described as follows:

(10 cats)

starts with default color: Cat 0 => red, Cat 9 => red

Type of operation | Start index | End index | Color

1 2 5 blue

Cat 0 => red, Cat 2 => blue, Cat 5 => blue, Cat 9 => red

Now, if I employ the strategy where querying the cat will look for the nearest cat on its left that is colored, I can get away with the following queries

2 1 // Cat 0's color is red, hence Cat 1 is red

2 4 // Cat 2's color is blue, hence Cat 4 is blue

However, querying cats 6,7,8 will result in wrong output

2 7 // Cat 5's color is blue, hence Cat 7 is blue (WRONG)

There is also another issue with updating ranges that are overlapping:

// continuing the example above 1 1 3 green

Cat 0 => red, Cat 1 => green, Cat 2 => blue, Cat 3 => green, Cat 5 => blue, Cat 9 => red

Now that is a mess.

Solution

In the end, I needed two intuitions to fix the above issues.

When querying, always look at the nearest colored cat on its left.
When updating, ensure that the cats are painted in such a way that statement one will always return the correct output. In particular, updating a range also breaks down any previous ranges into separate ranges.

Breaking down the steps:

Initialize an ordered map implemented by AVL tree (or any other balanced binary tree), in Java, use TreeMap.
Insert index 0 and color red to mean all cats started with red.
Per query, retrieve the successor of the given index in O(logn) and output the color.
Per update, query(end index + 1) and insert end+1 index with the returned color. Clear the items in the tree that is within the updating range. Insert the starting index with the new color.

Instead of updating both the start index and the end index, we update the start index(new color) and end+1 index(marking the start of an existing color range). Also, we remove colored cats if they are within the new updated range. This way, Space:

O(Q) worst case, all operations are paint-cat, bBST

Time (n being the maximum number of cats in the bBST):

O(logn) per paint-cat, worst case O(Qlogn), Q operations of paint-cat
O(logn) per query-cat, worst case O(Qlogn), Q operations of query-cat

Sample Java solution as follows

import java.util.*;

public class PaintCats {
    public static void main(String[] args) {
        // custom fast io
        Kattio k = new Kattio(System.in,System.out);

        // read input
        int numCat = k.getInt();
        int numQ = k.getInt();

        // init DS
        TreeMap<Integer,String> cats = new TreeMap<>();

        // predefined all cats to be Red
        cats.put(0,'Red');

        // read input
        for (int i=0;i<numQ;i++){
            int queryType = k.getInt();
            if (queryType == 1){ // change color operations
                int start = k.getInt();
                int end = k.getInt();
                String color = k.getWord();

                if (end != numCat){ // if it is the last cat, no need to add in lower limit
                    // get color by looking at the lowerbound,since all the correct lowerbound + color will be
                    // in the treeMap
                    char endColor = cats.floorEntry(end+1).getValue();
                    cats.put(end+1,endColor);
                }

                // remove to save space + speed up look up, safe since start - end is updated here
                cats.subMap(start,end).clear();

                // put in new start
                cats.put(start,color);

            } else { // query color operations
                int catNo = k.getInt();
                // look for the color by checking the lowerbound since all lowerbound are correctly updated
                k.println(cats.floorEntry(catNo).getValue());
            }
        }
        k.close();
    }
}

Visualization

Quick self-reminder: You cannot target id of only digits such as #1, #2 in CSS.

In CSS, identifiers (including element names, classes, and IDs in selectors) can contain only the characters [a-zA-Z0-9] and ISO 10646 characters U+00A0 and higher, plus the hyphen (-) and the underscore (_); they cannot start with a digit, two hyphens, or a hyphen followed by a digit.

P.S. Behind the scene, the visualization does not work exactly as described in the efficient solution above. It is quite a challenge to visualize Balanced Binary Search Tree 😂

Further P.S. Having a hard time making this mobile friendly, view it on wider display, Chrome or FireFox, if you can.

See it on codepen

​

Question Analysis​

Naive approach​

Getting better​

Map (implemented by Hashtable):​

Priority Queue (implemented by Binary Heap)​

Ordered Map (implemented by Balanced Binary Search Tree)​

Range Update, Point Query​

Solution​

Visualization​