Questions tagged [machine-learning]

Implementation questions about machine learning algorithms. General questions about machine learning should be posted to their specific communities.

0
votes
1answer
10 views

COnfusion regarding working mechansim of activation function

For binary classification irrespective of the model used, the sigmoid function is a good choice for output layer because the actual output value ‘Y’ is either 0 or 1 so it makes sense for predicted ...
0
votes
1answer
11 views

What to do when the gain for two attributes overlap when trying to find the root?

Starting from a data set, I found out the gain for every attribute, but two of them overlap. How do I choose the right attribute for being the root of the tree?
0
votes
0answers
9 views

How can I handle mismatch between schema and model after transformation?

Exploring ML.Net and I want to predict employee turnover. I have a dataset available, with a mix between numeric and string values. This is all just purely exploration in my attempt in getting to ...
0
votes
1answer
13 views

PySpark transform dataframe

Let's say I have the following data in a dataframe receipts: Id | Fruits 1 | ['apple', 'banana'] 2 | ['apple'] 3 | ['pear'] 4 | ['pear', 'banana'] And I want to ...
0
votes
1answer
18 views

How to cluster products from different categories with no information about the product as such?

For a recommender system, with user cold start and (kind of) product cold start as well (I do not have any ratings but I am taking the # of users for each product as the product rating)- I have ...
-2
votes
0answers
6 views

How to make word cloud for each cluster in kmeans

"I trying to print data points in each cluster using word cloud and my data points is vectorizer data(BOW),How to print words in each cluster using word cloud..?" I already done optimal k for k-means ...
0
votes
1answer
39 views

how to make use of data that unknown in the future?

I have 2 datasets. df1 stores the data about the restaurant, df2 is weather data. df1 date how many customers came Sales($) how many pokes used (kg) ... 0 20180101 ...
0
votes
1answer
23 views

how to transform Time-series trend into a measurable predictor variable

I have a time series data which explains the number of frauds in the transaction over 1 year timeline along with the target variable of fraud or not. X- axis is time-line and Y- axis is number of ...
-1
votes
0answers
28 views

Problem regarding an argument while running the code

I have been trying to run this github repo https://github.com/InnerPeace-Wu/densecap-tensorflow and I am aiming to train the model myself instead of using a pretrained model. I have followed the ...
-1
votes
0answers
11 views

How to extract Text content that are aligned in different format in a scanned document using python?

I am working on text extraction from scanned document,where i want to extract the contents in section.For ex: Considering a resume having different sections like 'objective','education' etc.. Here i ...
0
votes
1answer
21 views

Can we use a model trained with image classification to help in object detection in tensorflow?

I have used Tensorflow-for-poets to build an image classification model. However, I now want to use the trained model in an object detection model. Can I just import the .pb files directly or do I ...
1
vote
1answer
32 views

How to extract specified segments from a scanned document using machine learning

I was using tensorflow object detection api to train images. And it was successful detecting the labeled objects. Now I want the same concept to be implemented in text extraction. Using deep learning ...
0
votes
1answer
13 views

Finding imagesets which are labelled for object detection?

I am building some object detectors with Tensorflow. Really enjoying it. The most time-intensive part of any project I work on is gathering images and drawing bounding boxes around the relevant ...
0
votes
1answer
11 views

Which prediction model should we use to predict the list of colleges for a student

I have a training data set containing College names,student rank, branch, college cutoff. Which prediction model should I use to predict the list of colleges a student will get admission in according ...
0
votes
0answers
15 views

Getting the Tensorflow graph executed when training or predicting

I am basically trying to find the computational graph that gets executed when I train the model and predict with the model separately. I tried using tf.get_default_graph() but I am not sure what I am ...
-1
votes
0answers
13 views

Is there any Python function or an similar approach for creating dynamic clusters?

I'm working on a clustering problem which is based on Geography. I want to create clusters based on distance and time. for e.g: I want to create a cluster whose intensity will be high at 2 and that ...
1
vote
0answers
23 views

Running BERT on CPU instead of GPU

I am trying to execute BERT's run_clasifier.py script using terminal as below: python run_classifier.py --task_name=cola --do_predict=true --data_dir=<data-dir> --vocab_file=$BERT_BASE_DIR/...
-4
votes
0answers
42 views

How to resolve or just skip 'NotImplementedError' and find another way?

I want to solve an equation by using sympy.solve().equation Have a look at picture pls. I want to implement the process by python I used python3.7.0 and sympy Tool: Jupyter Notebook It shows an ...
0
votes
0answers
17 views

The minimum norm solution of linear equation and inner product

this is my first time to post a question there. I want to know some details about L1-KPCA algorithm. https://reader.elsevier.com/reader/sd/pii/S0031320312002877?token=...
1
vote
0answers
28 views

What does “channel” mean in the context of deep learning?

I am a beginner in deep learning and while I was studying it, I came across the term "channel" several times, such as quantization channel, input channels, and output channels. Nevertheless, I am ...
-1
votes
1answer
10 views

Incorporating feedback to retrain WordToVec for finding document similarity

I have trained Gensim's WordToVec on a text corpus,converted it to DocToVec and then used cosine similarity to find the similarity between documents. I need to suggest similar documents. Now suppose ...
0
votes
0answers
7 views

Detailed Description when hovering over a point in poinplot Using Python

I have a point plot graph with weeks as x-axis and Scores as the y-axis. When hovering over a point on that graph I want a pop up where my observations will come off. Is it possible ? fig,ax1= plt....
-2
votes
0answers
35 views

Delving Deep into rectifiers:surpassing human-level performance on ImageNet classificatiom

I have read this paper (Delving Deep into rectifiers:surpassing human-level performance on ImageNet classificatiom)again and again and again ,but I do not understand initialization of filter weights ...
-1
votes
0answers
17 views

Copy detected face's names in a live stream (using Open CV) to a text file

I am working on a project in which I have to detect the faces and display the names on LCD using Raspberry Pi. I am new to Python(using it for the first time actually). I am practicing on my PC right ...
0
votes
1answer
104 views

Accuracy of multivariate classification and regression models with Scikit-Learn

I wrote one simple linear regression model and one decision tree model, they work good. My question is, how to calculate the accuracy of these two models. I mean, whats the difference between ...
0
votes
1answer
25 views

How to train keras models consecutively

I'm trying to train different models consecutively withou needing to re-run my program or change my code all the time, so this way I can let my PC training different models I use a for loop while ...
0
votes
1answer
32 views

understanding xgboost prediction from individual trees

First I run a very simple xgb regression model which contains only 2 trees with 1 leaf each. Data available here. (I understand this is a classification dataset but I just force the regression to ...
0
votes
0answers
11 views

Unable to save Naive Bayes trained model using pyspark.mllib although HADOOP_HOME is set correctly

I have trained a model in python using Naive Bayes, but I am not able to save the model in any form. I have been implemented my code on windows. Although saving and loading a model is documented here ...
0
votes
1answer
49 views

Negative accuracy score in regression models with Scikit-Learn

I wrote a code that predicts house prices. The problem is, Im getting negative accuracy score. I have used 5 different algorithms and accuracy score is all over the place. The first problem that I ...
-1
votes
1answer
35 views

NameError:name 'create_model' is not defined …i have tried importing model from keras but it hasnt solved it .how to solve?

I tried creating a model using tensorflow. When I tried executing it shows me the other files are in this link------- github.com/llSourcell/tensorflow_chatbot def train(): enc_train, ...
0
votes
0answers
13 views

How to create proper data set for logistic regression in Python (based on DFS)?

I have to implement logistic regression in Python to find the path to the goal based on my data sets. My question is how to create proper dataset and how to use it for machine learning. As you can ...
0
votes
1answer
21 views

DeepFool could not broadcast input array from shape (28,28,28) into shape (28,28,1)

I'm trying to do a deepfool attack with autoencoder but it gives me the error below: InvalidArgumentError Traceback (most recent call last) c:\users\MrUserMan\appdata\local\...
2
votes
1answer
36 views

Can someone explain these lines: X1, y1 = np.c_[np.random.normal(loc=new_center[0],

I want to create a dataset first thought Gaussian disrtibution (make_blobs) which gives me: 300 rows with 2 columns each X,y then having the maximum of X as a new center next I'm kinda lost I don't ...
0
votes
0answers
12 views

How to deal with a significant difference between the positive and negative predictive values (PPV and NPV)

In my data set there is about 3 times more negatives than positives. When I train my classifier and produce a confusion matrix I get: PPV ca. 0.35 NPV ca. 0.75 After I balance my training set so ...
0
votes
1answer
10 views

How to generate data based on an existing balanced dataset for binary classification in Python?

I have a dataset of 100K rows and 100 columns and i want to generate samples based on this existing dataset in order to make the output shape of dataset 10M rows and 100 columns? Any idea how to do ...
0
votes
1answer
13 views

Improve the loss reduction in a neural network model

The following code is to train a neural network model of a given dataset (50,000 samples, 64 dim). from keras import Sequential from keras.layers import Dense from keras.optimizers import Adam X, y =...
0
votes
2answers
20 views

Brain js prediction

I'm trying to create ML with Brain.js that takes as input a number and outputs its count of significant digits. Examples: Input: 234 Output:3 Input: 2413 Output: 4 Input: 1 Output 1 <...
-2
votes
1answer
26 views

ValueError: shapes (4155,1445) and (4587,7) not aligned: 1445 (dim 1) != 4587 (dim 0)

I'm trying to predict with a different dataset. But still have a problem with it I've tried to change the parameters, but still no difference. X_train, X_test, y_train, y_test = train_test_split(X, ...
-2
votes
0answers
13 views

The process of unsupervised Machine Learning [on hold]

my friends. I am doing now unsupervised Machine Learning. I have a kind of dataset. In dataset there are datas like weather, temperature humidity of one day etc. I got features of each data. The size ...
-1
votes
0answers
24 views

What are the machine learning methods that can be used to model survival (time-to-event) data? [on hold]

I'm currently trying to develop a predictive model to predict graft survival after kidney transplant using machine learning algorithms. I want to use time-to-event (survival) data and would like to ...
1
vote
0answers
41 views

Lineare Regression: dtype('<M8[ns]') to dtype('float64')

I receive the following TypeError: TypeError: Cannot cast array data from dtype('<M8[ns]') to dtype('float64') according to the rule 'safe' The reasons seems to be connected to that part here: X = ...
0
votes
1answer
27 views

Overfitting on the lightgbm

I am interested in solving machine learning problems, so I use the lightgbm to predict it . But the more iteration it trains, the bad effect it has.Sounding like overfitting.The truth is that the data ...
-3
votes
0answers
22 views

Derivation of Adversarially Learned Inference paper [on hold]

I am having a problem in understanding the generator update equation from this paper. I have attached the image with highlight.
0
votes
1answer
48 views

Handle missing values : When 99% of the data is missing from most columns (important ones)

I am facing a dilemma with a project of mine. Few of the variables don't have enough data that means almost 99% data observations are missing. I am thinking of couple of options - Impute missing ...
0
votes
1answer
21 views

How to use best estimator from pipeline to predict test set?

I developed a pipeline using XGBoost which returned me a best estimator. However, trying to use this best estimator to predict my test set the following error is raised: "ValueError: Specifying the ...
-1
votes
0answers
18 views

Implementing multiple models for the same context

I want to predict how much money will be extracted in a ATM for a given day. I have created the model and it works fine. Now, say that I have 300 ATMs, what would be a correct approach to handle this? ...
-1
votes
0answers
18 views

Reading and traversing a tree to collect values

I am currently working on a problem with a neural network. I have created a tree of neurons. In my neural network, each neuron is given a random vector as a weight. I have the tree as a whole (the ...
1
vote
1answer
31 views

Trying to create GAN: InvalidArgumentError: Matrix size-incompatible

I am quite new to the field but I am trying to create a Generative Adversarial Network to generate Music. I have a model that is a combination of the Generator and Discriminator but when I train it, ...
0
votes
0answers
17 views

Perceptron Class: The truth value of a Series is ambiguous

This is my code. I get an error when i try to run the 'fit' function. This is a breast cancer dataset. x and y are shown in code. I created x_train and y_train using Train-test split. Note that type(...
0
votes
0answers
15 views

How to set a fixed and proper Sequence Length in the Sentiment Analysis using LSTM?

I am working on a Sentiment Classification problem, and as many of you guys know that we have to do pre-processing of the text in order to feed it into word embedding layers. So, accordingly, in the ...