Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Changed to make it more general #3

Open
wants to merge 7 commits into
base: master
Choose a base branch
from
Open

Conversation

rsesha
Copy link

@rsesha rsesha commented Apr 2, 2017

I have created a slightly different version of your program with the
following changes:

I have changed it to take one single "Training File" as input and it
will automatically split it into "tr_data", "cv_data" and "gt_data"
files. This might be easier for some folks who don't understand the
difference between the 3 files unless they have watched Andrew Ng's
video and know what these mean (and do).
I have created another function called "select_num_cols" that
automatically selects numeric columns from the data set above. This
enables most data scientists to get a smaller feature set than what they
have. It will also work well with your Gaussian Distribution program.
Since this version works with more than 2 variables, I have avoided
plotting the variables.
I hope these changes will be acceptable. If not, you can create a
version.

Ram Seshadri and others added 7 commits April 1, 2017 22:19
I have created a slightly different version of your program with the
following changes:

I have changed it to take one single "Training File" as input and it
will automatically split it into "tr_data", "cv_data" and "gt_data"
files. This might be easier for some folks who don't understand the
difference between the 3 files unless they have watched Andrew Ng's
video and know what these mean (and do).
I have created another function called "select_num_cols" that
automatically selects numeric columns from the data set above. This
enables most data scientists to get a smaller feature set than what they
have. It will also work well with your Gaussian Distribution program.
Since this version works with more than 2 variables, I have avoided
plotting the variables.
I hope these changes will be acceptable. If not, you can create a
version.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant