Solution to Perm-Missing-Elem by codility

16 Jan

January 16, 2014 Sheng 123

Question: http://codility.com/demo/take-sample-test/perm_missing_elem

Question Name: PermMissingElem or PermMissingElement

The main challenge of this question is the XOR operations: X^X=0, and 0^X=X. Logically, the addition and subtraction operations also are able to do this work. But taking the overflow in computer into consideration, they become a very bad choice.

def solution(A):

length = len(A)

xor_sum = 0

for index in range(0, length):

xor_sum = xor_sum ^ A[index] ^ (index + 1)

return xor_sum^(length + 1)

123 Replies to “Solution to Perm-Missing-Elem by codility”

the.sentinel.ua says:
April 22, 2014 at 7:04 am
Hi Sheng, would you please be so kind to explain in a few more details the main idea behind this solution? I clearly see how it works, but cannot imagine how to come to it (my own solution is pure arithmetic, though the overflow issues gave me some pains at the beginning). Thanks in advance!
Reply
- Sheng says:
  April 22, 2014 at 3:20 pm
  It is the same as your arithmetic solution. You might use addition and subtraction, because x + a – a = x. Here I used the XOR operation, with x XOR a XOR a = x. One great advantage of XOR over addition and subtraction is that, XOR never lead to overflow. As a result, the XOR operation is widely used in encryption.
  Reply
the.sentinel.ua says:
April 23, 2014 at 5:36 am
Finally got how beautiful XOR is used here. I see it as subtraction and addition done in parallel with the idea that if there were no missing element in A then after the loop we would come up with the xor_sum value equal 0. And it does not matter in which order we add/subtract numbers, they all are there (except one, discovered in the last XOR application). Cool. Thanks again 🙂
Reply
- Sheng says:
  April 23, 2014 at 10:30 am
  That is great! It is my pleasure to see my code be helpful!
  Reply

class Solution {

public int solution(int[] A) {

double sum = 0;

for (int i=0; i<A.length; i++)

sum += A[i];

double res = 0.5*(A.length+1)*(A.length+2) - sum;

return (int)res;

}

Sheng says:
July 21, 2014 at 6:44 pm
Another 100/100 solution.
But I do not support it. Float/Double number will lose some accuracy. In theory, if the sum is big enought, and the missing element is small enought, 0.5*(A.length+1)*(A.length+2) will be the same as sum after rounding. And it might give a wrong result in such cases.
PS: if you want to post some code, please use <pre> and </pre>, instead of code tag.
PPS: I removed your comment on Solution to Passing-Cars by codility, because its code is imcomplete and unreadable.
Reply

the.sentinel.au says:
July 23, 2014 at 3:22 am
It won’t…just won’t give a wrong result. It was just issue in java, so I used double instead. It should work perfectly.
Reply
- Sheng says:
  July 23, 2014 at 8:45 am
  It works well for the test. But passing the tests does not mean it is perfect. Bug is a bug, no matter you found it or not. For example, assuming significant digits in double have length of one, 0.9 E10 – 1 = 0.9 E10
  Using int will lead to overflow.
  Using float/double will lead to wrong answer in rare cases.
  Using BigInteger/BigDecimal works. But it should cost much more/much much more time.
  Reply
  - the.sentinel.au says:
    July 23, 2014 at 1:36 pm
    Easy, man. Solution works perfectly for task conditions:
    – N is an integer within the range [0..100,000];
    – the elements of A are all distinct;
    – each element of array A is an integer within the range [1..(N + 1)].
    In general case, you are right. Though if it was an issue, I would simply develop another solution for that.
    Reply
    - Sheng says:
      July 23, 2014 at 2:17 pm
      Sorry! If considering the limitation in the test, your answer is right. I did not notice the range of N.
      Reply
  - the.sentinel.au says:
    July 23, 2014 at 1:41 pm
    By the way, using double in this case is an equivalent of using big integer, as you can see in expression. And could you point me out on that rare cases, that could happen?
    Reply
    - Sheng says:
      July 23, 2014 at 2:51 pm
      I did a quick test on Python like:
      >>> N = 200000000
      >>> sum = 0
      >>> for i in xrange(1, N+1): sum += i + 1
      >>> (N+1)*(N+2) // 2.0 – sum
      0.0
      The testing set is the integers from 2 (inclusive) to 200000001 (inclusive). And the integer 1 is missing.
      Reply
the.sentinel.au says:
July 23, 2014 at 3:52 pm
Yeah, you are right:) Sorry for my awkward and unprofessional answers.
Thank you very much!
I should study more of it:)
Reply
- Sheng says:
  July 23, 2014 at 4:25 pm
  You are studying more, when you are doing these practice. Actually, years ago, when I faced this type of questions at the first time, I made the same mistake. So never mind. Solve it correctly once, and you are not going to make the same mistake again.
  Enjoy it!
  Reply
Dimitri says:
August 3, 2014 at 1:41 pm
Here’s my solution in Python:
1
2
3
4
def solution(A):
    N = len(A) + 1
    sum_N = (N * (N+1)) / 2
    return sum_N - sum(A)

Reply
- Dimitri says:
  August 3, 2014 at 1:43 pm
  @Sheng How can I paste in code so that it’s displayed like in your post above?
  Reply
- Sheng says:
  August 4, 2014 at 12:05 am
  Hello @Dimitri ! Please include your code inside a <pre> … </pre> block, not <code> block.
  This works well with Python. Because Python will automatically change int to long, which could hold as large integer as you want. But for other languages, it may lead to overflow.
  Reply
- Son Thai says:
  October 10, 2015 at 4:37 am
  This solution is hinted by the PDF reading material provided by Codility. see last page.
  Reply
- vi says:
  June 26, 2020 at 1:18 am
  this solution works only on positive integers
  Reply
  - Vi says:
    June 26, 2020 at 1:22 am
    and not with lists including Zero as well.
    Reply
    - Sheng says:
      July 27, 2020 at 11:16 pm
      Question Description: each element of array A is an integer within the range [1..(N + 1)].
      I would say the solution does not need to consider zero or negative numbers.
      Reply
- James says:
  December 7, 2021 at 1:24 pm
  Is this a solution to oddOccurrences? Coz it does not seem to work. I’m using python 2.7.11 by the way (incase that is what is causing the problem)
  Reply

Here’s my solution, a bit different approach. It goes through every permutation cycle it can find in the array and sets the values to 0 along the way. If there is an element that is not zeroed, the solution is its index + 1 (zero-indexed). Otherwise the answer is N+1. It should not consider an element more than two times, so complexity is O(N). Code in C#:

using System;

class Solution

{

public int solution(int[] A)

{

for (int i = 0; i < A.Length; i++)

{

if (A[i] == 0)

{

continue;

}

int n = A[i] - 1;

while (n != -1 && n < A.Length)

{

int next = A[n] - 1;

A[n] = 0;

n = next;

}

for (int i = 0; i < A.Length; i++)

{

if (A[i] != 0)

{

return i + 1;

}

return A.Length + 1;

}

Sheng says:
September 26, 2014 at 2:40 pm
Great! Thanks for sharing your solution!
Reply
Mike says:
January 24, 2016 at 4:37 am
This solution doesn’t pass codility tests.
Reply
- Mike says:
  January 24, 2016 at 4:39 am
  Sorry, I compared incorrect tests.
  Reply

Here is my solution in Java…..

int size = A.length;

int[] B = new int[size+1];

int missing = 0;

for (int i=0 ; i<size ; ++i)

{

B[A[i]-1] = A[i];

}

for (int i=0 ; i<=size ; ++i)

{

if(B[i] == 0)

missing = i+1;

}

return missing;

Sheng says:
October 15, 2014 at 9:14 am
Thanks for sharing your solution! But there are two minor issues. On one hand, space complexity is not O(1). On the other hand, when the first missing is found, you could simply stop searching and return it.
Reply
- Idan says:
  October 15, 2014 at 9:50 am
  Yes you are right… I added the “break;” later and didn’t find a way to edit the post… I did got 100% for this solution though…
  Reply
  - Sheng says:
    October 15, 2014 at 9:53 am
    The complexity estimation is not 100% accurate. Your solution demos the counting sort.
    Reply

David says:
October 16, 2014 at 7:52 am
Hi! My solution is in C. The main idea is to compare the ideal sequence 1..N+1 with the given array. But with a whole sum, it is overflowed. So, I try accumulating the differences between ideal sequence and array for each position. In that way the number holds smaller and no overflows.
1
2
3
4
5
6
int solution(int A[], int N) {
    long missedNum = 0, i;
    for(i=1; i<=N;i++)    missedNum += i - A[i-1];
    missedNum += N + 1;
    return (int)missedNum;
}

Reply
- Sheng says:
  October 16, 2014 at 9:22 am
  Overflow is still possible. Consider a big array, with the first three element as
  maxInt-1, maxInt-2, maxInt-3, ……, 3, 2, 1
  Reply
  - David says:
    October 16, 2014 at 10:09 am
    jeje! Yes, you are right, I have also seen, but using the premises of the problem “N in range ([0..100,000])” and using a long int, the probability was low…Any way, is not a safe solution. Any idea without using XOR?
    Thank you Sheng
    Reply
    - Sheng says:
      October 16, 2014 at 10:14 am
      You are more than welcome 🙂 XOR is the best choice as far as I see. If you really hate XOR, BigInt in Java and integer in Python might be a choice.
      Reply
  - 20150524 says:
    May 24, 2015 at 9:35 am
    There should be no overflow since the maximum N of the task is 100000 instead of maxInt. According to David’s code, if the array A with size 100000 is in reverse order (ie. [100001, 100000, …, 2]), the minimum value of missedNum can be reached should be -2500050000 ((100001+50002)/2*50000-(1+50000)/2*50000), which is larger than the minimum value of long.
    Reply
    - Sheng says:
      May 31, 2015 at 11:32 pm
      Yes, you are right. But traditionally and pratically, XOR is better and more widely used. No mather how larger N is, XOR works perfectly.
      Reply
Claudiu says:
November 14, 2014 at 8:20 am
Sheng, your solution doesn’t work with codility. It compiles but it fails on some tests:
Example test: [4, 1, 3, 2]
WRONG ANSWER (got 5 expected 1)
Example test: [4, 1, 3]
WRONG ANSWER (got 2 expected 0)
Reply
- Sheng says:
  November 14, 2014 at 11:45 am
  I tired minutes ago, and it passed all the test.
  Are you running the solution in another challenge? “The array contains integers in the range [1..(N + 1)]”. How could the expected answer be 0?
  Reply
Kim says:
December 20, 2014 at 6:31 pm
Stuck with the overflow, I saw this article and spent few hours studing about the XOR operations. For your convenience, here goes a short description of what I found:
First, let’s take a look at the properties of XOR.
1. XOR is commutative, which means
X ^ Y = Y ^ X.
2. XOR is assiciative, i.e.,
X ^ (Y ^ Z) = (X ^ Y) ^ Z.
3. For the truth values, as in the article,
X ^ X = 0, 0 ^ X = X.
With this properties, we can adapt a summation starategy similiar to the one of algebraic operations.
Now we will assume a set [1, …, N + 1] (or an array), and evaluate the sum S of the elements by XOR operation. Then,
S = 1 ^ 2 ^ … ^ N ^ (N +1) ^ 1 ^ 2 ^ … ^ N ^ (N +1).
This might look wierd, but it has a useful charaterictics, that is;
S = 1 ^ 2 ^ … ^ N ^ (N +1) ^ 1 ^ 2 ^ … ^ N ^ (N +1)
= 1 ^ 1 ^ 2 ^ 2 ^ … N ^ N ^ (N + 1) ^ (N + 1)
= 0.
Back to our problem, we have an array A[] having N elements in [1, …, N + 1] and a missing elelent. Let the missing element M, then
A[1] ^ A[2] ^ … ^ A[N] ^ M = 1 ^ 2 ^ … ^ N ^ (N +1),
because any permutation in the array eventually holds all elements.
Now the preperation for the answer is almost complete. To find M, we will modify the sum slightly.
S = 1 ^ 2 ^ … ^ N ^ (N +1) ^ 1 ^ 2 ^ … ^ N ^ (N +1)
= A[1] ^ A[2] ^ … ^ A[N] ^ M ^ 1 ^ 2 ^ … ^ N ^ (N +1)
= (A[1] ^ A[2] ^ … ^ A[N] ^ 1 ^ 2 ^ … ^ N) ^ M ^(N +1)
= (A[1] ^ 1 ^ A[2] ^ 2 ^ … ^ A[N] ^ N) ^ M ^ (N + 1).
Since S = 0, we can get M as
M = S ^ (A[1] ^ 1 ^ A[2] ^ 2 ^ … ^ A[N] ^ N) ^ (N + 1)
= (A[1] ^ 1 ^ A[2] ^ 2 ^ … ^ A[N] ^ N) ^ (N + 1).
And finally, this is my personal understanding of the solution and I do not garantee its correctness. Thanks for reminding the XOR operations.
Reply
- Kim says:
  December 20, 2014 at 6:37 pm
  I forgot to mention that the array index shown in the text starts with 1, not 0. Sorry for that.
  Reply
- Sheng says:
  December 20, 2014 at 7:07 pm
  Your explanation is more than awesome! It should be greatly helpful to the others.
  Thanks for your contribution!
  Reply
- rick says:
  December 19, 2015 at 9:36 pm
  Thanks so much for this. It very likely saved me hours of trying to figure it out myself.
  Reply
Kim says:
December 21, 2014 at 11:21 pm
For those who prefer arithmetic solution to XOR (including myself :), I thought a little more on the behavior of XOR.
Assume a set [1, …, N + 1], an array A[], and a missing element M, as previously.
The arithmetic solution to the problem is very intuitive and seems much preferable to many of us. That is,
M = (1 + 2 + … + N + (N + 1)) – (A[1] + A[2] + … + A[N]),
which came from
(1 + 2 + … + N + (N + 1)) – (A[1] + A[2] + … + A[N] + M) = 0.
The reason I was uncomfortable with XOR is this. When I add or subtract some number, I can see certainly what I am doing, just like addition is DOing something and subtraction is UNDOing something. But in XOR operations, things didn’t look so clearly.
However, I found that XOR solution is basically not different from arithmetic one. In fact, we can almost consider XOR as normal arithmetic operation like this:
r = p ^ q <=> r = p + q (mod 2).
What about the subtraction? In mod 2 system, subtraction is same as addition because
p – q = p – q + 2q = p + q (mod 2).
Don’t be confused by the modulo system (just like exactly what I did), because XOR operation is essentially bit-wise operation.
After all, if we consider addition and subtraction as DOing-UNDOing something in arithmetic operations, the same happens in XOR operations. We do something by XORing(adding) and also undo something by XORing(subtracting at this time).
Now we can see M in the XOR way. Like before, to get M, we will subtract the sum of the array A[] from the sum of the set [1, …, N + 1]. Then
M = sum_of([1, …, N + 1]) minus sum_of(A[]),
which becomes
M = (1 ^ 2 ^ … ^ N ^ (N + 1)) ^ (A[1] ^ A[2] ^ … ^ A[N]).
If you need M as the form described in the article, we can rearrange it like this as we did before;
M = (A[1] ^ 1 ^ A[2] ^ 2 ^ … ^ A[N] ^ N) ^ (N + 1).
PS: Sorry for my poor english. It’s not my mother tongue.
Reply
- Sheng says:
  December 21, 2014 at 11:22 pm
  Thanks! You did an awesome work!
  Reply
  - Julek says:
    July 24, 2015 at 11:38 am
    Hi Sheng!
    I can’t get the idea of using XOR in this task. If we have, let’s say, 5^4^7^6^5 what is DOing and what is UNDOing something. I know how XOR works on bits but don’t get the idea of summing something with XOR. We have 1^2 = 3 so something like adding then 3^3 = 0 so subtracting?
    Reply
    - Julek says:
      July 25, 2015 at 7:04 am
      Sorry for spamming. I have already figured it out. The solution is hard to understand. I always come here to see your solutions after submiting mine. I can always learn something new.
      Reply
      - Sheng says:
        August 12, 2015 at 12:14 am
        That fine! Thanks for visiting!!!
        PS: it is easier if you had some experience in cryptography。
    - Sheng says:
      August 12, 2015 at 12:12 am
      Hello Julek,
      It is actuallly nothing about adding/subtracting. In this problem, in range 0 to N – 1, we have input[i] = i + 1, except one number (to say M) is missing. The M’s index should be M – 1.
      When we XOR every (index + 1) and its value, we are doing (the order does not change the XOR result):
      input[0] XOR 1 XOR input[1] XOR 2 XOR … XOR input[N-1] XOR N
      that is (with nput[i] = i + 1):
      1 XOR 1 XOR 2 XOR 2 XOR 3 XOR 3 XOR … XOR N XOR N
      Because any for any number X, X XOR X = 0, and X XOR 0 = X. The final result of previous equation is:
      0 XOR 0 XOR 0 … XOR M XOR … XOR 0 = M.
      Finally, we get the missing number M.
      Reply

Hello,
Ca you please tell why my solution giving just 40% correctness though its 80% in Performance.

def solution(A):

A = sorted(A)

if len(A) == 0:

return 1

a = A[0]

b = A[-1:][0]

A_sum = sum(A)

b_sum = (b*(b+1))/2

if a > 1:

a_sum = ((a-1)*((a-1)+1))/2

b_sum = b_sum - a_sum

number = b_sum - A_sum

if number == 0:

return b+1

return number

Please suggest?

Sheng says:
January 24, 2015 at 1:48 am
First of all, thanks for visiting my blog. But sorry I cannot help you to debug.
Some suggestions are here:
1. b = A[-1:][0] has a better form: b = A[-1], quick and simply.
2. No need to sort the input in O(NlogN). Use b = max(A) instead.
3. Your solution easily leads to overflow of A_sum and b_sum in most programming languages. Python could handle with big integers well. But the operations on big int is expensive.
Reply

I solved it in C++ with another solution, using a logic vector instead. I think its an advantage, because no big numbers are possible. But it uses two for loops-one for filling the logic and one to analyse it.

int solution(vector A) {

// write your code in C++11

vector<bool> B(A.size()+2, false);

for(unsigned int i =0; i < A.size(); i++)

{

B[A[i]] = true;

}

for(unsigned int i =1; i < B.size(); i++)

{

if (B[i] == false) return i;

}

return -1;

}

Sheng says:
January 27, 2015 at 9:26 pm
If the next time you got some error in submission, please contact me directly (https://34.145.67.234/contact-me/). I also fixed one little warning in your code 🙂
Thanks for sharing and enjoy coding!
Reply

Kin Cheung says:
February 3, 2015 at 4:56 am
After I have come up with this version, I could totally understand how you came up with the XOR version as Kim explained it.
(starting to write algo in Python. What a handy tool it is.)
1
2
3
4
5
6
def solution(A):
    N = len(A)
    mysum = 0
    for i in range(0, N):
        mysum += ((i + 1) - A[i])
    return mysum + (N + 1)

Reply
- Kim says:
  February 6, 2015 at 1:59 am
  Thanks for your sharing! Another 100/100 solution. But, as a matter of fact, your code is basically same as David said earlier, hence same comments on the overflow hold.
  Reply
  - Sheng says:
    February 27, 2015 at 12:25 am
    Python can manage the big integer automatically. So there is no overflow in this piece of code. However, if the integer is too big, Python needs lots of memory and it is time-consuming.
    It is strongly recommended NOT to use this solution.
    Reply
    - Kin says:
      July 27, 2015 at 2:38 am
      Thank you for the comment!
      Reply

@Sheng Thanks.
JavaScript

function solution(A) {

// initial xor_sum

var xor_sum = 0;

for(var index = 0, len = A.length; index < len; ++index){

xor_sum = xor_sum ^ A[index] ^ (index + 1) ;

}

return xor_sum ^ (A.length + 1);

}

Sheng says:
March 4, 2015 at 12:46 am
You are more than welcome! 🙂
Reply

This is my solution in Pascal (score: 100 of 100):

function solution(A: array of longint; N: longint): longint;

var

i,suma,x:longint;

begin

x:=0;

suma:=0;

for i:=1 to N+1 do

x:=x+i;

for i:=0 to N-1 do

begin

suma:=suma+A[i];

end;

solution:=x-suma;

end;

Sheng says:
March 13, 2015 at 11:59 pm
Probably the first Pascal solution in my blog. Thanks!
Reply
- calinutz says:
  March 14, 2015 at 12:01 pm
  I am planning on posting more solutions in pascal. I just discovered your blog, and I like it :), so you’ll be seeing other posts from me. I am going to try to find the right pascal code for each of your python solutions
  Reply

Pedro Ricardo Garcia says:
March 13, 2015 at 12:47 pm
1
2
3
4
5
6
7
8
9
def solution(a)
    a.sort!
    a.each_index do |index|
        if a[index] != index+1
            return index+1
        end
    end
    a.size+1
end

Reply
- Sheng says:
  March 14, 2015 at 12:04 am
  Oh~ Could you tell me what is the programming language? Thanks!
  Reply
  - satoshi says:
    June 16, 2015 at 9:48 am
    Ruby
    Reply
    - Sheng says:
      June 17, 2015 at 10:29 pm
      Thank you!
      Reply

100/100 solution

// you can use includes, for example:

// #include <algorithm>

// you can write to stdout for debugging purposes, e.g.

// cout << "this is a debug message" << endl;

#include <algorithm>

int solution(vector<int> &A) {

// write your code in C++11

sort(A.begin(),A.end());

for(int i=1;i<=A.size();i++){

if(A[i-1]!=i)

return i;

}

return A.size()+1;

}

Sheng says:
March 19, 2015 at 12:04 am
Thanks for sharing!
UPDATE: it is NOT the right solution.
Reply
grois says:
April 6, 2015 at 3:03 pm
how did you get 100/100?
sort complexity is at least n*log(n)?
Reply
- Sheng says:
  April 6, 2015 at 10:22 pm
  The grading system is not so accurate to distinguish O(N) and O(NlogN). It is not the expected solution.
  Reply

#include <stdio.h>

#include <math.h>

int solution(int A[], int N) {

// write your code in C99

long total = 0;

for(int i = 0; i <= (N+1); i++)

total += i;

for(int i = 0; i < N; i++)

total -= A[i];

return total;

}

JavaScript 100%:

function solution(A) {

var length = A.length;

var sum = ((length + 1) /2) * (length + 2);

var sumMinusMissing = 0;

for (i = 0; i < length; i++) {

sumMinusMissing += A[i];

}

return sum - sumMinusMissing;

}

Tikki says:
June 27, 2015 at 9:25 am
I love your solution, but I’m trying to understand (I’m still learning) – how did you get the var sum line? What makes this calculation work?
var sum = ((length + 1) /2) * (length + 2);
Reply
- Sheng says:
  July 17, 2015 at 12:08 am
  Thanks! Please refer to the arithmetic sequence:
  https://en.wikipedia.org/wiki/Arithmetic_progression#Formulas_at_a_Glance
  Reply

Ashwani Kumar says:
May 16, 2015 at 9:11 am
As a programmer i never used XOR in my code. With your example i was able to get into the details of XOR, Thanks for that. But as i was trying to test your code i found that it seems to fail for following input:
Your test case: [5, 6, 4, 8]
Output (stderr):
WARNING: producing output may seriously slow down your code!
Output:
Missing number: 14
Returned value: 14
I used Java to run your code.
Reply
- Sheng says:
  May 16, 2015 at 11:40 pm
  Hello Ashwai, the problem says: “The array contains integers in the range [1..(N + 1)], which means that exactly one element is missing.”. Therefore your test case is invalid.
  Reply

Hi Sheng and the rest of you guys,
I’m trying to port your XOR code to Java, I wrote this

import java.util.Arrays;

public class Solution {

public int solutionJ(int[] A) {

int length = A.length;

int xor_sum=0;

for (int i:A) {

xor_sum = xor_sum ^ A[i] ^ (i + 1);

}

xor_sum = xor_sum^(length + 1);

return xor_sum;

}

and I get array a java.lang.ArrayIndexOutOfBoundsException…
Can you help? Thanks!

Sheng says:
May 24, 2015 at 12:02 am
Hello, it is a Java language problem.
for (int i:A)
the i is the element, not the index.
Reply

My solution is not O(N) but O(N*log(N)) but it is still 100% on codility.

import java.util.*;

class Solution {

public int solution(int[] A) {

Arrays.sort( A );

for(int i=0;i<A.length;i++){

if (A[i]!=i+1) return i+1;

}

return A.length+1;

}

Sebastian Cheung says:
July 3, 2015 at 3:35 am
1
2
3
[2, 3, 1, 5, 4] -> 6,
[2, 3, 1, 5, 4, 6] -> 7,
[2, 3, 1,6] -> 7

so especially the last case, should it not return either 4 or 5 instead of 7?
or it is designed for just a single missing number and not more than a single missing number? and if there is no missing number then it simply returns the next number in sequence?
Reply
- Sheng says:
  July 23, 2015 at 11:17 pm
  The third testing case is wrong. Please double read the question. A possible testing case is [2, 3, 1, 4] or [2, 3, 1, 5]. In your testing case, there are two missing integers as 4 and 5.
  Reply
  - D1code says:
    August 10, 2015 at 1:36 pm
    Could someone explain to me why the test is ignoring real world approach to problem solving?I think it should return 4 and 5 if they are both missing in a list! Now i see why it takes time solving this problems. The problem is solved but not what codility wants!
    Below return 4 but cant return more than 1 missing number
    1
    2
    3
    4
    5
    6
    7
    def solution(A):
        B = sorted(A)
        test = 0
        for index in range(0, len(B)):
            test +=1
            if test != B[index]:
                return test
    
    Reply
    - Sheng says:
      August 15, 2015 at 1:09 am
      You solution is wrong. Please try [1].
      I ignored the real-world approach, because this is the 0/1 world 🙂 The XOR solution is the most space- and time- efficient solution.
      Reply
Will says:
July 28, 2015 at 11:21 pm
Figured I’d throw out this solution. Performance is essentially identical to yours but it was a little easier to read, at least for me.
1
2
3
4
def solution_two(A):
    top_val = len(A) + 1
    val_dif = sum(A[:]) - sum(range(len(A)+1))
    return top_val - val_dif

Reply
- Sheng says:
  August 12, 2015 at 11:47 pm
  Thansk for sharing. However, addition/substration may not be a good idea, if we consider overflow.
  In addition:
  1. sum(A[:]) is euqal to sum(A)
  2. sum(range(len(A)+1)) could be optimized to sum(xrange(len(A)+1))
  3. your solution could be simplified to:
  1
  2
  def solution(A):
  return sum(xrange(len(A) + 2)) - sum(A)
  
  Thanks!
  Reply

This gives you all the missing numbers. Thoughts please!

def solution(A):
    #declare variables
    C=[]
    e=0
    #Assign Variables
    endindex = max(A) + 1
    #create my list that is correct pattern
    B = [x for x in range(1, endindex)]
    # sort the list
    A.sort()
    if len(A) == 0:
        return 0
    elif len(A) == 1 and A[0] == 1:
        return A[0]+1
    else:
        #iterate the two lists to find missing number.
        while e < len(B):
            #check of item is equal to item
            if A[e] != B[e]:
                #insert number if missing to adjust the index
                #for next iteration otherwise, result will have error
                A.insert(e, B[e])
                #collect missing number into a list
                C.append(B[e])
                print B[e]
                #return B[e]
            e+=1

int solution(int A[], int N) {

// write your code in C99

int i=0;

int k=0;

int j=0;

for(i=0;i<N;i++)

{

k+=i+1;

j+=A[i];

// printf("k=%d j=%dn", k, j);

}

k+=N+1;

return abs(k-j);

}

Sheng says:
October 16, 2015 at 10:59 pm
Will fail due to overflow.
Reply

class Solution {

public int solution(int[] A) {

// write your code in Java SE 8

Arrays.sort(A);

int missing = 0;

for (int i = 0; i < A.length; i++) {

if (A[i] != i + 1) {

missing = i + 1;

break;

}

missing = i + 2;

}

return missing;

}

public int solution(int[] A) {

// write your code in C# 6.0 with .NET 4.5 (Mono)

int Len = A.Length;

int TotalSum = (Len*(Len + 1))/2;

int ActualSum = 0;

for(int i =0; i <= Len -1; i++) {

ActualSum+= A[i];

}

if(ActualSum == TotalSum) {

return 1;

}

else {

return 0;

}

Sheng says:
October 17, 2015 at 12:23 am
Idea is right. But your return value is incorrect.
Reply

#include <cassert>

#include <vector>

#include <numeric>

using namespace std;

int solution(vector<int> &A) {

// write your code in C++11

size_t size=A.size();

assert(size>=0);

if(size==0)return 1;

//cout<<sizeof(long);

long N=static_cast<long>(size)+1L;

return N*(N+1L)/2L-accumulate(A.begin(),A.end(),0L);

}

My “solution” is definitely not as neat as yours. My question is: does “upward casting to long” have any flaw here? I am aware of two: 1. performance hit, it could be slower than bit operation; 2. it’s not portable since long could be 8 bytes on gcc, but 4 bytes on VS, although both of them are on 64-bit platform. Any other concern?
Thanks!

Sheng says:
November 21, 2015 at 11:04 pm
accumulate(A.begin(),A.end(),0L) may cause overflow (result is too big to fit in an integer).
upward casting usually is fine. But we cannot guarantee it is upward. It is possible that static_cast is a downward casting, when long is smaller than size_t.
Reply
- micropentium6 says:
  November 22, 2015 at 11:59 am
  Thank you for the timely reply!
  You may not notice that the third argument in accumulate is ‘0L’. According to C++ spec, accumulate, as a template function, takes the type on the third argument as the type for the returning value. So, I guess it’s just fine. If I miss anything, please let me know.
  Your concern on the cast between long and size_t is legit. size_t’s type is platform dependent for sure. if long is 4 bytes and size_t is 8 bytes (windows), the static_cast from size_t to long is downward. you may notice I checked the size of long in my source code, just in case. Well, codility probably not using Windows compiler.
  It makes me curious: is bitwise operation portable?
  Reply
  - Sheng says:
    November 22, 2015 at 12:53 pm
    No problem. You are welcome!
    0L means signed long, on some platform (32bits for long), the maximum positive value is 2147483647. While the “N is an integer within the range [0..100,000];”, in the worst case, accumulate return 5000050000, which is too large to fit in a signed long integer.
    Bitwise operation is portable. And it is the expected solution here.
    Reply

micropentium6 says:
November 22, 2015 at 3:47 pm
Thanks! That’s what I pointed out as well. 🙂 Well, I guess if the upward casting is a must have, in this case, long long might be a better choice since it’s guaranteed to be 8 bytes on all platforms I am aware of.
BTW, do you mind sharing how you learnt these bitwise tricks? I know “bitwise hacks”, but I will never be able to remember them or feel confident to apply them on any coding test.
Thanks!
Reply
- Sheng says:
  November 23, 2015 at 11:20 pm
  Long long works here. But bitwise solution is still preferred, because it is universal without overflow.
  In terms of bitwise operations, you have to use it many times to remember it. If you studied security or hardware, you have more chance to learn it.
  Reply
thepurple says:
December 16, 2015 at 5:52 am
A little modernization for the primary code.
1
2
3
4
5
6
def solution(A):
    length = len(A)
    xor_sum = length + 1
    for index in range(0, length):
        xor_sum = xor_sum ^ A[index] ^ (index + 1)
    return xor_sum

No last XOR required, you can just initialize xor_sum with: length + 1
Reply
- Sheng says:
  December 16, 2015 at 11:32 pm
  You’re totally right.
  Reply

my 100% C# solution

public static int solution(int[] A)

{

int length = A.Length;

double sum1 = 0;

double sum2 = 0;

for (int i = 1; i <= length + 1; i++)

sum1 += i;

for (int i = 0; i < length; i++)

sum2 += A[i];

return (int)(sum1 - sum2);

}

My Java solution (100%)

public int solution(int[] A) {

Arrays.sort(A);

for (int i = 0; i < A.length; i++) {

if (A[i] != i + 1)

return i + 1;

}

return A.length + 1;

}

This solution give me 100/100 in c#

using System;

using System.Collections.Generic;

using System.Linq;

// you can also use other imports, for example:

// using System.Collections.Generic;

// you can write to stdout for debugging purposes, e.g.

// Console.WriteLine("this is a debug message");

class Solution {

public int solution(int[] A) {

// write your code in C# 6.0 with .NET 4.5 (Mono)

IEnumerable N = Enumerable.Range(1, A.Length + 1);

int missingNumber = N.Except(A).FirstOrDefault();

return missingNumber;

}

My contribution in C#..

public int solution(int[] A)

{

if (A == null) return -1;

var sum = 0;

for (int index = A.Count(); index > 0; index--)

sum += A.ElementAt(index - 1) - index;

return Math.Abs(sum);

}

jokes says:
February 23, 2018 at 6:09 am
I see how XOR would work but seems completely unreadable and over-engineered to me compared to the below 1-liner in Python (100% score):
1
2
def solution(A):
return sum(range(1, len(A)+2)) - sum(A)

Reply

Hi Shang,
I’m loving your blog and learning much, thank you!
My arithmetic version equals or outperforms the XOR version in time performance for every test. I see that you specifically state “overflow” as being the catch on why XOR is better. Perhaps since in this test N is is limited to 100,000, the arithmetic version is better? And then maybe if N were allowed to go up to much higher limit (millions or billions), the XOR version would be safer to use? Thank you again!
Javascript
XOR version

function PermMissingElemXOR(A) {

var xor_sum = 0;

for(var index = 0, len = A.length; index < len; index++){

xor_sum = xor_sum ^ A[index] ^ (index + 1) ;

}

return xor_sum ^ (A.length + 1);

}

Arithmetic Version

function PermMissingElemArithmetic(A) {

const arrayLength = A.length;

if(arrayLength == 0) return 1;

const expectedSum = (arrayLength + 1) * (arrayLength + 2)/2;

return expectedSum - A.reduce((a,b)=>{return a + b});

}

Ah, I see now.. the repetitive calling of array length was slowing down the XOR version from my last comment. This one now outperforms the arithmetic version.
Javascript

function solution(A) {

const len = A.length;

var xor_sum = length + 1;

for(var index = 0; index < len; ++index){//index++ gives same result

xor_sum = xor_sum ^ A[index] ^ (index + 1) ;

}

return xor_sum;

}

Sheng says:
July 28, 2018 at 10:51 pm
Great to see that you got the solution by yourself 🙂 I would prefer the XOR solution, because:
1. Both solutions are O(N) time complexity. [A.reduce((a,b)=>{return a + b}) is O(N)].
2. XOR solution woks all the time, no matter what is the N’s limit.
Thanks!
Reply

filip5114 says:
December 12, 2018 at 12:37 pm
1
2
def solution(A):
return next((i+1 for i, v in enumerate(sorted(A)) if i+1 != v), len(A)+1)

This is my solution.
Is there any advanatge using your XOR solution or mine is fine as well?
Reply
- Sheng says:
  December 19, 2018 at 11:14 pm
  I do not fully understand your expression. But sorted is O(NlogN), while XOR solution is O(N).
  Reply

I’ve solved it 100% in Java, but with an additional HashSet (i.e. O(N) space and O(N) time complexity) as follow:

// you can also use imports, for example:

import java.util.*;

// you can write to stdout for debugging purposes, e.g.

// System.out.println("this is a debug message");

class Solution {

public int solution(int[] A) {

// write your code in Java SE 8

HashSet<Integer> valSet = new HashSet<Integer>();

for (int i=0;i<A.length; i++){

// If found, then remove (i.e. remove even occurrences)

if (valSet.contains(A[i])){

valSet.remove(A[i]);

}

else{

// Odd entry

valSet.add(A[i]);

}

Iterator valSetIter = valSet.iterator();

// Should have only one value based on valid input

if (valSetIter.hasNext()){

return (Integer)valSetIter.next();

}

return -1;

}

Can someone explains how XORing will resolve this?

Many thanks 🙂

Sunil says:
September 24, 2019 at 9:15 am
1
2
3
4
5
def solution(A):
    A = set(A)
    for i in range(1, len(A) + 2):
        if i not in A:
            return i

Reply

I did this on sets, also works.

def solution(A):

if len(A) == 0:

return 1

Amax = max(A)+1

B = set(range(1, Amax))

A = set(A)

ans = list(B-A)

if len(ans) == 0:

return Amax

else:

return ans[0]

JT says:
November 3, 2019 at 6:53 am
100%, Still think the XOR method better though.
1
2
3
4
def solution(A):
    num=set(range(1,len(A)+2))-set(A)
    for i in num:
        return(i)

Reply

Here is code for PHP that I porting from OP
100/100

function solution($A) {

// write your code in PHP7.0

$length = count($A);

$xor_sum = 0;

for ($i = 0; $i < $length; $i++) {

$xor_sum = $xor_sum ^ $A[$i] ^ ($i + 1);

}

return $xor_sum^($length + 1);

}

Sriharsha says:
March 23, 2020 at 11:12 pm
Hi Shen,
I am using c# and Linq, however this is not giving me 100% result. Can any one explain what am I doing wrong. My understanding is that there’s is only one element that is not paired and rest of the elements are always paired. Shouldn’t the following give me accurate results?
1
2
var fi= a.GroupBy(g =>g).ToList().OrderBy(l=>l.Count()).FirstOrDefault();
fi.Key

Appreciate your response on this
Reply
- Sheng says:
  March 28, 2020 at 10:35 pm
  Sorry, I do not understand C# and linq. Maybe the others can help.
  Reply

Thank you for your solution with xor! Nice touch about the extra information that xor never overflows. That way we can do the “summing” without worrying about using long or things like that.

BTW a solution with java and xor(scored 100%):

public int solution(int[] A) {

int xorSum = 0;

for (int i=0; i<A.length; i++) {

xorSum = xorSum ^ A[i] ^ (i+1);

}

return xorSum ^ (A.length + 1);

}

This is my solution, would you please comment on this?

def solution(numbers):

S = sum(numbers)

current_sum = 0

max_num = max(numbers)

min_num = min(numbers)

for i in range(min_num, max_num+1):

current_sum += i

return current_sum - S

Sheng says:
July 27, 2020 at 11:23 pm
Sorry, but it does not work for [1, 2, 3] or [2, 3, 4].
Reply

Renato Milan dos Santos says:
September 25, 2020 at 9:45 am
My 100% java solution:
1
2
3
4
5
6
7
8
public int solution(int[] A) {
  int r = 1;
  Arrays.sort(A);
  while (r <= A.length && r == A[r - 1]) {
    r++;
  }
  return r;
}

Reply

Here’s a C# solution (100%) using a hashset to record the numbers that have been found. Once a matching pair is found the number is removed so by the end we’re left with a single entry containing the unmatched number:

public int solution(int[] N)

{

var found = new HashSet<int>();

for (var i = 0; i < N.Length; i++)

{

var numberAtThisPosition = N[i];

// If this is the first time we've seen a number add it to the hashset.

if (!found.Contains(numberAtThisPosition))

{

found.Add(numberAtThisPosition);

}

else

{

// If we've found this number already remove it.

found.Remove(numberAtThisPosition);

}

return found.First();

}

lopz82 says:
February 7, 2021 at 10:18 am
My Python implementation:
1
2
3
from typing import List
def per_missing_elem(arr: List[int]) -> int:
return sum(range(1, len(arr) + 2)) - sum(arr)

Reply

Wow what an awesome place to get info. below is what i came up with (v slowly)
I have tried to get my small brain around XOR and it just hurts! so this is my simple way in c#. trying to learn so very happy to be torn apart. it got 100%

using System;

using System.Linq;

class Solution {

public int solution(int[] A) {

int len = A.Length;

if (len<1)

{

return 1;

}

var loop = Enumerable.Range(1, len+1).ToArray();

int sums = 0;

foreach (var item in loop)

{

sums += item;

}

int sums2 = 0;

foreach (var item in A)

{

sums2 += item;

}

return sums - sums2;

}

I tried to be smart:

int len = A.Length;

if (len<1)

{

return 1;

}

var loop = Enumerable.Range(1, len+1).ToArray();

int sums = loop.Sum();

int sums2 = A.Sum();

return sums - sums2;

but that only gave me 80% which suprised me.

Raul says:
April 18, 2022 at 1:59 pm
I made it like this:
1
2
3
4
5
6
7
def solution(A):
    if A == []:
        return 1
    A = sorted(A)
    s = ((len(A) + 1) * ((len(A) + 1) + 1)) // 2

    return s - sum(A)

Reply

123 Replies to “Solution to Perm-Missing-Elem by codility”

Leave a Reply to Sheng Cancel reply