How expensive is calling java.util.HashMap.keySet ()?

Question

How expensive is calling java.util.HashMap.keySet ()?

I implemented a sparse matrix as List<Map<Integer,Double>> .
To get all the entries in row i, I call list.get(i).keySet() . How expensive is the call?

I also used a library for an alternative implementation like List<TIntDoubleHashMap> .
What is the cost of calling list.get(i).keys() here?

Do you have any further ideas on how to implement an efficient sparse matrix?
Or can you provide a list of existing implementations in java?

+4

java

user306708 Apr 08 '10 at 13:53

source share

3 answers

Depends on the class that implements List and Map. If you use the List class that implements java.util.RandomAccess (i.e. ArrayList), then the call to get (i) is O (1). If this is a LinkedList, it will be O (n).

- Edited to show the following code snippet (since verdy_p is not read well below and likes to remove the tangent): -

 // In HashMap.java, line 867, JDK 1.6.0.24, how much more // constant time do we want? public Set<K> keySet() { Set<K> ks = keySet; return (ks != null ? ks : (keySet = new KeySet())); }

- end of editing -

A call to keySet () for most map implementations will be constant.

Regarding passing keySet () If you use a Map-backed Map implementation (like HashMap), keySet () relies on entrySet (), which returns the internal iterator supported by the array. So, the iteration of keySet () is O (n).

I would also suggest that this applies to most (if not all) map implementations that are supported by arrays.

For implementations of SortedMap (for example, TreeMap), the iteration on its keys will be akin to iterating through the tree from the lowest to the maximum. This is equivalent to a failed binary search, which is O (n).

Both cases look like O (n). If you use Eclipse, you can look at the code that implements java classes and better understand their complexity.

For classes under java.util.concurrent (e.g. ConcurrentHashMap) you will have to take other considerations to determine how expensive they are.

To expand the bit, if you use a linked list, list.get (i) .keyset () will be O (n). With an ArrayList, this will be O (1). Moving the key set will depend on whether you are using a map with an array (HashMap) or SortedMap (TreeMap). In both cases, the move will be O (n), and the first will be much faster than later, since traversing the array will always be faster than moving along pointers (or links in this particular case of Java).

Now, if you take both list.get (i) .keySet () and iterate over the set, with an implementation of a linked list , which will be O (n ^ 2). Therefore, instead of doing list.get (i) .keySet (), you should use an iterator (see below pseudocode , for clarity, it excludes the general syntax)

This is O (n ^ 2) for lists that do not implement java.util.RandomAccess (e.g. LinkedList):

 for( int i = 0; i < list.size(); i++ ) { Set keySet = list.get(i).keySet(); for( Integer key : keySet.iterator() ) { ... stuff (assuming constant time) ... } }

This is O (n) for the same type of List implementations:

 for( Map m : list.iterator() ) { for( Integer key : m.keySet() ) { ... stuff (assuming constant time) ... } }

+5

luis.espinal Apr 08 '10 at 14:33

source share

It's as cheap as it gets, as it is a look.

From the jdk7 884 source line :

 public Set<K> keySet() { Set<K> ks = keySet; return (ks != null ? ks : (keySet = new KeySet())); }

Trove is probably faster, because unlike the Java Collection Framework, it can work directly with primitives without expensive boxing / unpacking.

+2

polygenelubricants Apr 08 '10 at 14:00

source share

tucuxi · Accepted Answer · 2010-04-08T14:03:05+0000

According to sparse matrices / arrays in Java, the Colt library includes this functionality; plunging into their Javadoc API , it seems true, and time is on.

Also, your implementation does not seem to use column sorting (you only have hash maps in rows). They are made, and optimized for ints and double, as is the case with Trove (but not in the standard Java case, which uses objects with significant overhead). I recommend Colt.

How expensive is calling java.util.HashMap.keySet ()?

More articles: