remove row with nan value

Question

let's say, for example, i have this data:

data <- c(1,2,3,4,5,6,NaN,5,9,NaN,23,9)
attr(data,"dim") <- c(6,2)
data

     [,1] [,2]
[1,]    1  NaN
[2,]    2    5
[3,]    3    9
[4,]    4  NaN
[5,]    5   23
[6,]    6    9

Now i want to remove the rows with the NaN values in it: row 1 and 4. But i don't know where these rows are, if it's a dataset of 100.000+ rows, so i need to find them with a function and remove the complete row.

Can anybody point me in the right direction?

also related https://stackoverflow.com/questions/4862178/remove-rows-with-all-or-some-nas-missing-values-in-data-frame — Álvaro A. Gutiérrez-Vargas, May 21 '21 at 12:04

score 47 · Accepted Answer · answered May 11 '11 at 09:26

47

The function complete.cases will tell you where the rows are that you need:

data <- matrix(c(1,2,3,4,5,6,NaN,5,9,NaN,23,9), ncol=2)
data[complete.cases(data), ]

     [,1] [,2]
[1,]    2    5
[2,]    3    9
[3,]    5   23
[4,]    6    9

answered May 11 '11 at 09:26

Andrie

163,419
39
422
472

score 7 · Answer 2 · answered May 11 '11 at 13:05

7

DATA

data <- matrix(rnorm(100000000), ncol=10000)
idx <- sample(1:10000,100)
idy <- sample(1:10000,100)
data[idx,idy] <- NaN

FUNCTION

foo <- function(d=data) d[apply(d,2,function(x) any(is.nan(x))),]

RUNTIMES

> system.time(data[complete.cases(data),])
   user  system elapsed 
   3.53    0.12    3.65 

> system.time(foo(data))
   user  system elapsed 
   2.37    0.09    2.46

answered May 11 '11 at 13:05

Wojciech Sobala

6,803
2
19
26

1

Shouldn't this be `d[!apply(d,1,function(x) any(is.nan(x))), ]` ? (using the "1" for the dimension instead of "2", and having the "!" to get only the lines which do not have NaN? ) – MeloMCR Sep 08 '15 at 17:24

remove row with nan value

2 Answers2

Linked

Related