Questions tagged [unsupervised-learning]

Unsupervised learning refers to machine learning contexts in which there is no prior 'training' period in which the learning agent is trained on objects of known type. As such, supervised learning includes such disciplines as mathematical clustering, whereby data is segmented into clusters based on the minimisation or maximisation of mathematical properties and not on an attempt to classify by understanding the right context.

Unsupervised learning (or clustering) refers to machine learning algorithms in which there is no 'label' available for the training data and the model tries to learn the underlying manifold. As such, unsupervised learning includes such disciplines as mathematical clustering, whereby data is segmented into clusters based on the minimization or maximization of mathematical properties and not on an attempt to classify by understanding the right context.

547 questions

votes

1 answer

principal component analysis (PCA) in R: which function to use?

Can anyone explain what the major differences between the prcomp and princomp functions are? Is there any particular reason why I should choose one over the other? In case this is relevant, the type of application I am looking at is a quality…

r linear-algebra pca unsupervised-learning

asked Jan 10 '13 at 00:57

AndraD

2,672
6
34
48

votes

1 answer

Semi-supervised Naive Bayes with NLTK

I have built a semi-supervised version of NLTK's Naive Bayes in Python based on the EM (expectation-maximization algorithm). However, in some iterations of EM I am getting negative log-likelihoods (the log-likelihoods of EM must be positive in every…

python machine-learning nltk naivebayes unsupervised-learning

asked Oct 23 '12 at 13:55

SUP

votes

1 answer

Passing Target/Label data to Scikit-learn GridSearchCV's fit method for OneClassSVM

From my understanding, One-Class SVM's are trained without target/label data. One answer at Use of OneClassSVM with GridSearchCV suggests passing Target/Label data to GridSearchCV's fit method when the classifier is the OneClassSVM. How does the…

scikit-learn svm unsupervised-learning gridsearchcv one-class-classification

asked Oct 01 '19 at 01:40

user3731622

3,753
2
31
64

votes

1 answer

BERT performing worse than word2vec

I am trying to use BERT for a document ranking problem. My task is pretty straightforward. I have to do a similarity ranking for an input document. The only issue here is that I don’t have labels - so it’s more of a qualitative analysis. I am on my…

machine-learning deep-learning word2vec unsupervised-learning bert-language-model

asked Apr 21 '19 at 21:30

user3741951

votes

2 answers

Custom Hebbian Layer Implementation in Keras - input/output dims and lateral node connections

I'm trying to implement an unsupervised ANN using Hebbian updating in Keras. I found a custom Hebbian layer made by Dan Saunders here - https://github.com/djsaunde/rinns_python/blob/master/hebbian/hebbian.py (I hope it is not poor form to ask…

python tensorflow keras neural-network unsupervised-learning

asked Dec 28 '18 at 18:38

thposs

votes

2 answers

How to programmatically determine the column indices of principal components using FactoMineR package?

Given a data frame containing mixed variables (i.e. both categorical and continuous) like, digits = 0:9 # set seed for reproducibility set.seed(17) # function to create random string createRandString <- function(n = 5000) { a <- do.call(paste0,…

r cluster-analysis pca feature-selection unsupervised-learning

asked Jul 17 '18 at 10:54

mnm

1,695
2
15
39

votes

1 answer

How to prepare a dataset for speech recognition

I need to train a Bidirectional LSTM model to recognize discrete speech (individual numbers from 0 to 9) I have recorded speech from 100 speakers. What should I do next? (Suppose I am splitting them into individual .wav files containing one number…

speech-recognition recurrent-neural-network unsupervised-learning

asked Dec 26 '15 at 16:41

udani

1,131
1
9
27

votes

1 answer

scipy.optimize + kmeans clustering

I have the following setup for kmeans clustering algorithm that I am implementing for a project: import numpy as np import scipy import sys import random import matplotlib.pyplot as plt import operator class KMeansClass: #takes in an npArray…

python optimization scipy k-means unsupervised-learning

asked Nov 06 '13 at 20:18

anonuser0428

8,987
18
55
81

votes

8 answers

K- Means algorithm

I'm trying to program a k-means algorithm in Java. I have calculated a number of arrays, each of them containing a number of coefficients. I need to use a k-means algorithm in order to group all this data. Do you know any implementation of this…

java algorithm machine-learning grouping unsupervised-learning

asked Jun 28 '09 at 21:22

dedalo

2,441
12
30
34

votes

4 answers

Selecting an appropriate similarity metric & assessing the validity of a k-means clustering model

I have implemented k-means clustering for determining the clusters in 300 objects. Each of my object has about 30 dimensions. The distance is calculated using the Euclidean metric. I need to know How would I determine if my algorithms works…

machine-learning k-means cluster-analysis unsupervised-learning

asked Nov 12 '11 at 04:18

user350556

votes

0 answers

Unsupervised clustering of words in R without knowing k

As a beginner in NLP, I am trying to find the best way to cluster single words with unsupervised clustering, specifically where the number of clusters k is not known in advance. I have a group of words that contains clusters of words are very…

r string nlp cluster-analysis unsupervised-learning

asked Aug 17 '20 at 07:01

the_darkside

5,688
7
36
83

votes

1 answer

Implement CVAE for a single image

I have a multi-dimensional, hyper-spectral image (channels, width, height = 15, 2500, 2500). I want to compress its 15 channel dimensions into 5 channels.So, the output would be (channels, width, height = 5, 2500, 2500). One simple way to do is to…

tensorflow keras deep-learning unsupervised-learning dimensionality-reduction

asked Jun 27 '20 at 01:01

thunder

2,087
6
21
45

votes

0 answers

Why grpreg library and gglasso library in R are giving different results for group LASSO?

I have been trying to do unsupervised feature selection using LASSO (by removing class column). The dataset includes categorical (factor) and continuous (numeric) variables. Here is the link. I built a design matrix using model.matrix() which…

r feature-selection unsupervised-learning lasso-regression

asked Feb 27 '20 at 17:58

Mehmet Yildirim

votes

2 answers

Clustering images based on their similarity

I am facing a problem of image clustering based on their similarity, without knowing the number of clusters. Ideally i would like to achieve something that resembles this http://cs231n.github.io/assets/cnnvis/tsne.jpeg…

machine-learning image-processing computer-vision cluster-analysis unsupervised-learning

asked Oct 19 '19 at 11:01

Bartek Wójcik

votes

1 answer

How to build an unsupervised CNN model with keras/tensorflow?

I'm trying to build a CNN for an image-to-image translation application, the input of the model is an image, and the output is a confidence map. There are no labeled confidence as the ground truth during training, but a loss function is designed to…

python-3.x tensorflow keras unsupervised-learning

asked Apr 15 '19 at 01:41

Jemma

Prev 1 2

…

36 37 Next