How to replace missing value with neighboring column value in R

Asked Mar 24 '17 at 18:23

Active Mar 24 '17 at 18:23

Viewed 55 times

I have a dataset with several missing values in X2015, my thought is to replace them with the values in X2014 (if there are values in X2014 but not in X2015) But if there are still no values in X2014, then leave it just be missing value.

Following is my code but somehow it doesn't work. There are hundreds rows and I don't want to replace them one by one.

y <- read.csv("literacy_rate.csv", stringsAsFactors = FALSE)
y1 <- y$X2015
y2 <- y$X2014

for (i in y1) {
  if (is.na(y1) == TRUE){
    y1[i] <- y2[i]
  }
}

And following is a sample of my data

X2014 X2015
na    97.524
na    na
na    38.168
70.9  71
na    97
na    na
na    92.9
98.0  98.98
na    99.76
na    na
98.94 na
.     .
.     .
.     .

My expected result is

X2014 X2015
na    97.524
na    na
na    38.168
70.9  71
na    97
na    na
na    92.9
98.0  98.98
na    99.76
na    na
98.94 98.94
.     .
.     .
.     .

asked Mar 24 '17 at 18:23

Patrick D

1

Just use subsetting. Something like this `df$b[is.na(df$b)] – lmo Mar 24 '17 at 18:29
@lmo Thank you so much!! It works pretty well! – Patrick D Mar 24 '17 at 18:38

How to replace missing value with neighboring column value in R

0 Answers0