# Questions tagged [machine-learning]

Implementation questions about machine learning algorithms. General questions about machine learning should be posted to their specific communities.

31,280 questions

**0**

votes

**1**answer

10 views

### COnfusion regarding working mechansim of activation function

For binary classification irrespective of the model used, the sigmoid function is a good choice for output layer because the actual output value ‘Y’ is either 0 or 1 so it makes sense for predicted ...

**0**

votes

**1**answer

11 views

### What to do when the gain for two attributes overlap when trying to find the root?

Starting from a data set, I found out the gain for every attribute, but two of them overlap. How do I choose the right attribute for being the root of the tree?

**0**

votes

**0**answers

9 views

### How can I handle mismatch between schema and model after transformation?

Exploring ML.Net and I want to predict employee turnover. I have a dataset available, with a mix between numeric and string values.
This is all just purely exploration in my attempt in getting to ...

**0**

votes

**1**answer

13 views

### PySpark transform dataframe

Let's say I have the following data in a dataframe receipts:
Id | Fruits
1 | ['apple', 'banana']
2 | ['apple']
3 | ['pear']
4 | ['pear', 'banana']
And I want to ...

**0**

votes

**1**answer

18 views

### How to cluster products from different categories with no information about the product as such?

For a recommender system, with user cold start and (kind of) product cold start as well (I do not have any ratings but I am taking the # of users for each product as the product rating)- I have ...

**-2**

votes

**0**answers

6 views

### How to make word cloud for each cluster in kmeans

"I trying to print data points in each cluster using word cloud and my data points is vectorizer data(BOW),How to print words in each cluster using word cloud..?"
I already done optimal k for k-means ...

**0**

votes

**1**answer

39 views

### how to make use of data that unknown in the future?

I have 2 datasets. df1 stores the data about the restaurant, df2 is weather data.
df1
date how many customers came Sales($) how many pokes used (kg) ...
0 20180101 ...

**0**

votes

**1**answer

23 views

### how to transform Time-series trend into a measurable predictor variable

I have a time series data which explains the number of frauds in the transaction over 1 year timeline along with the target variable of fraud or not.
X- axis is time-line and Y- axis is number of ...

**-1**

votes

**0**answers

28 views

### Problem regarding an argument while running the code

I have been trying to run this github repo https://github.com/InnerPeace-Wu/densecap-tensorflow and I am aiming to train the model myself instead of using a pretrained model. I have followed the ...

**-1**

votes

**0**answers

11 views

### How to extract Text content that are aligned in different format in a scanned document using python?

I am working on text extraction from scanned document,where i want to extract the contents in section.For ex: Considering a resume having different sections like 'objective','education' etc.. Here i ...

**0**

votes

**1**answer

21 views

### Can we use a model trained with image classification to help in object detection in tensorflow?

I have used Tensorflow-for-poets to build an image classification model. However, I now want to use the trained model in an object detection model. Can I just import the .pb files directly or do I ...

**1**

vote

**1**answer

32 views

### How to extract specified segments from a scanned document using machine learning

I was using tensorflow object detection api to train images. And it was successful detecting the labeled objects.
Now I want the same concept to be implemented in text extraction. Using deep learning ...

**0**

votes

**1**answer

13 views

### Finding imagesets which are labelled for object detection?

I am building some object detectors with Tensorflow. Really enjoying it.
The most time-intensive part of any project I work on is gathering images and drawing bounding boxes around the relevant ...

**0**

votes

**1**answer

11 views

### Which prediction model should we use to predict the list of colleges for a student

I have a training data set containing College names,student rank, branch, college cutoff. Which prediction model should I use to predict the list of colleges a student will get admission in according ...

**0**

votes

**0**answers

15 views

### Getting the Tensorflow graph executed when training or predicting

I am basically trying to find the computational graph that gets executed when I train the model and predict with the model separately.
I tried using tf.get_default_graph() but I am not sure what I am ...

**-1**

votes

**0**answers

13 views

### Is there any Python function or an similar approach for creating dynamic clusters?

I'm working on a clustering problem which is based on Geography. I want to create clusters based on distance and time.
for e.g:
I want to create a cluster whose intensity will be high at 2 and that ...

**1**

vote

**0**answers

23 views

### Running BERT on CPU instead of GPU

I am trying to execute BERT's run_clasifier.py script using terminal as below:
python run_classifier.py --task_name=cola --do_predict=true --data_dir=<data-dir> --vocab_file=$BERT_BASE_DIR/...

**-4**

votes

**0**answers

42 views

### How to resolve or just skip 'NotImplementedError' and find another way?

I want to solve an equation by using sympy.solve().equation
Have a look at picture pls. I want to implement the process by python
I used python3.7.0 and sympy
Tool: Jupyter Notebook
It shows an ...

**0**

votes

**0**answers

17 views

### The minimum norm solution of linear equation and inner product

this is my first time to post a question there. I want to know some details about L1-KPCA algorithm.
https://reader.elsevier.com/reader/sd/pii/S0031320312002877?token=...

**1**

vote

**0**answers

28 views

### What does “channel” mean in the context of deep learning?

I am a beginner in deep learning and while I was studying it, I came across the term "channel" several times, such as quantization channel, input channels, and output channels. Nevertheless, I am ...

**-1**

votes

**1**answer

10 views

### Incorporating feedback to retrain WordToVec for finding document similarity

I have trained Gensim's WordToVec on a text corpus,converted it to DocToVec and then used cosine similarity to find the similarity between documents. I need to suggest similar documents. Now suppose ...

**0**

votes

**0**answers

7 views

### Detailed Description when hovering over a point in poinplot Using Python

I have a point plot graph with weeks as x-axis and Scores as the y-axis. When hovering over a point on that graph I want a pop up where my observations will come off. Is it possible ?
fig,ax1= plt....

**-2**

votes

**0**answers

35 views

### Delving Deep into rectifiers:surpassing human-level performance on ImageNet classificatiom

I have read this paper (Delving Deep into rectifiers:surpassing human-level performance on ImageNet classificatiom)again and again and again ,but I do not understand initialization of filter weights ...

**-1**

votes

**0**answers

17 views

### Copy detected face's names in a live stream (using Open CV) to a text file

I am working on a project in which I have to detect the faces and display the names on LCD using Raspberry Pi. I am new to Python(using it for the first time actually). I am practicing on my PC right ...

**0**

votes

**1**answer

104 views

### Accuracy of multivariate classification and regression models with Scikit-Learn

I wrote one simple linear regression model and one decision tree model, they work good.
My question is, how to calculate the accuracy of these two models. I mean, whats the difference between ...

**0**

votes

**1**answer

25 views

### How to train keras models consecutively

I'm trying to train different models consecutively withou needing to re-run my program or change my code all the time, so this way I can let my PC training different models
I use a for loop while ...

**0**

votes

**1**answer

32 views

### understanding xgboost prediction from individual trees

First I run a very simple xgb regression model which contains only 2 trees with 1 leaf each. Data available here. (I understand this is a classification dataset but I just force the regression to ...

**0**

votes

**0**answers

11 views

### Unable to save Naive Bayes trained model using pyspark.mllib although HADOOP_HOME is set correctly

I have trained a model in python using Naive Bayes, but I am not able to save the model in any form. I have been implemented my code on windows. Although saving and loading a model is documented here ...

**0**

votes

**1**answer

49 views

### Negative accuracy score in regression models with Scikit-Learn

I wrote a code that predicts house prices. The problem is, Im getting negative accuracy score.
I have used 5 different algorithms and accuracy score is all over the place.
The first problem that I ...

**-1**

votes

**1**answer

35 views

### NameError:name 'create_model' is not defined …i have tried importing model from keras but it hasnt solved it .how to solve?

I tried creating a model using tensorflow. When I tried executing it shows me
the other files are in this link------- github.com/llSourcell/tensorflow_chatbot
def train():
enc_train, ...

**0**

votes

**0**answers

13 views

### How to create proper data set for logistic regression in Python (based on DFS)?

I have to implement logistic regression in Python to find the path to the goal based on my data sets. My question is how to create proper dataset and how to use it for machine learning.
As you can ...

**0**

votes

**1**answer

21 views

### DeepFool could not broadcast input array from shape (28,28,28) into shape (28,28,1)

I'm trying to do a deepfool attack with autoencoder but it gives me the error below:
InvalidArgumentError Traceback (most recent call
last)
c:\users\MrUserMan\appdata\local\...

**2**

votes

**1**answer

36 views

### Can someone explain these lines: X1, y1 = np.c_[np.random.normal(loc=new_center[0],

I want to create a dataset first thought Gaussian disrtibution (make_blobs) which gives me: 300 rows with 2 columns each X,y then having the maximum of X as a new center next I'm kinda lost I don't ...

**0**

votes

**0**answers

12 views

### How to deal with a significant difference between the positive and negative predictive values (PPV and NPV)

In my data set there is about 3 times more negatives than positives. When I train my classifier and produce a confusion matrix I get:
PPV ca. 0.35
NPV ca. 0.75
After I balance my training set so ...

**0**

votes

**1**answer

10 views

### How to generate data based on an existing balanced dataset for binary classification in Python?

I have a dataset of 100K rows and 100 columns and i want to generate samples based on this existing dataset in order to make the output shape of dataset 10M rows and 100 columns?
Any idea how to do ...

**0**

votes

**1**answer

13 views

### Improve the loss reduction in a neural network model

The following code is to train a neural network model of a given dataset (50,000 samples, 64 dim).
from keras import Sequential
from keras.layers import Dense
from keras.optimizers import Adam
X, y =...

**0**

votes

**2**answers

20 views

### Brain js prediction

I'm trying to create ML with Brain.js that takes as input a number and outputs its count of significant digits.
Examples:
Input: 234 Output:3
Input: 2413 Output: 4
Input: 1 Output 1
<...

**-2**

votes

**1**answer

26 views

### ValueError: shapes (4155,1445) and (4587,7) not aligned: 1445 (dim 1) != 4587 (dim 0)

I'm trying to predict with a different dataset. But still have a problem with it
I've tried to change the parameters, but still no difference.
X_train, X_test, y_train, y_test = train_test_split(X, ...

**-2**

votes

**0**answers

13 views

### The process of unsupervised Machine Learning [on hold]

my friends. I am doing now unsupervised Machine Learning. I have a kind of dataset. In dataset there are datas like weather, temperature humidity of one day etc. I got features of each data. The size ...

**-1**

votes

**0**answers

24 views

### What are the machine learning methods that can be used to model survival (time-to-event) data? [on hold]

I'm currently trying to develop a predictive model to predict graft survival after kidney transplant using machine learning algorithms. I want to use time-to-event (survival) data and would like to ...

**1**

vote

**0**answers

41 views

### Lineare Regression: dtype('<M8[ns]') to dtype('float64')

I receive the following TypeError:
TypeError: Cannot cast array data from dtype('<M8[ns]') to dtype('float64') according to the rule 'safe'
The reasons seems to be connected to that part here:
X = ...

**0**

votes

**1**answer

27 views

### Overfitting on the lightgbm

I am interested in solving machine learning problems, so I use the lightgbm to predict it . But the more iteration it trains, the bad effect it has.Sounding like overfitting.The truth is that the data ...

**-3**

votes

**0**answers

22 views

### Derivation of Adversarially Learned Inference paper [on hold]

I am having a problem in understanding the generator update equation from this paper. I have attached the image with highlight.

**0**

votes

**1**answer

48 views

### Handle missing values : When 99% of the data is missing from most columns (important ones)

I am facing a dilemma with a project of mine. Few of the variables don't have enough data that means almost 99% data observations are missing.
I am thinking of couple of options -
Impute missing ...

**0**

votes

**1**answer

21 views

### How to use best estimator from pipeline to predict test set?

I developed a pipeline using XGBoost which returned me a best estimator.
However, trying to use this best estimator to predict my test set the following error is raised: "ValueError: Specifying the ...

**-1**

votes

**0**answers

18 views

### Implementing multiple models for the same context

I want to predict how much money will be extracted in a ATM for a given day. I have created the model and it works fine. Now, say that I have 300 ATMs, what would be a correct approach to handle this? ...

**-1**

votes

**0**answers

18 views

### Reading and traversing a tree to collect values

I am currently working on a problem with a neural network. I have created a tree of neurons. In my neural network, each neuron is given a random vector as a weight. I have the tree as a whole (the ...

**1**

vote

**1**answer

31 views

### Trying to create GAN: InvalidArgumentError: Matrix size-incompatible

I am quite new to the field but I am trying to create a Generative Adversarial Network to generate Music. I have a model that is a combination of the Generator and Discriminator but when I train it, ...

**0**

votes

**0**answers

17 views

### Perceptron Class: The truth value of a Series is ambiguous

This is my code. I get an error when i try to run the 'fit' function. This is a breast cancer dataset. x and y are shown in code. I created x_train and y_train using Train-test split. Note that type(...

**0**

votes

**0**answers

15 views

### How to set a fixed and proper Sequence Length in the Sentiment Analysis using LSTM?

I am working on a Sentiment Classification problem, and as many of you guys know that we have to do pre-processing of the text in order to feed it into word embedding layers. So, accordingly, in the ...