Community Tip - You can Bookmark boards, posts or articles that you'd like to access again easily! X
Hi, could you summarize the process to make an external dataset (eg. csv or excel, or sql..) available in Neuron in order to be analyzed?
Kind regards
Hi Marco,
In order to have an external dataset ready for the Machine Learning Engine Neuron to be analyzed there needs to be what is called Data Preparation :
This consisits mainly of the following:
I hope this helps,
Best Regards,
Amine
Hi Marco...
Not to knock the previous response to your question, but answers like this really drive me nuts!!! You ask a straightforward question and get an answer that better relates to theory and/or concept, as opposed to something that is actually immediately useful.
So, to answer your question:
You first need to create your dataset. This is the equivalent of naming your database. You then need to configure your dataset. This is basically the process of defining what your data will look like. Next you can actually load your data. While the Neuron plugin has some of these services, it's easier and faster to do these steps using something like Postman.
Here are some screenshots that might help. Note that any text between {{ }} would be replaced by you with applicable values unless you've configured a PostMan environment
Create the dataset--Headers (the headers will almost always be the same for all the post commands):
Create the dataset--body:
Configure the dataset--body:
The body is basically a big JSON object which contains all the metadata about your data. Amine's answer gives some insight how to optimize your data which would drive what your data configuration looks like.
Load Dataset Data--header:
Load Dataset Data--Body:
Here you would just choose your csv file and submit it to ColdLight. Here is the header in my CSV file that I'm using for my current project:
identifier,entry_dt,Latitude,Longitude,LastMaintained,MaintenanceOrg,MaintenanceTech,WindSpeed,Temperature,Humidity,AirContamination,SeismicReading,FrictionHarmonics,Voltage,Lubrication,GearboxModel,GearboxOEM,GearboxAge,BladeModel,BladeOEM,BladeAge,GeneratorModel,GeneratorOEM,GeneratorAge,GearboxFailure,BladeFailure,GeneratorFailure,AccelerationMagnitude,RPM
You can see from my dataset config body screenshot how some of these fields are defined.
Once you've got your data loaded you can now begin the process of submitting jobs for profiles, signals, and scoring.
Hopefully this will get you on the right path to successfully getting data loaded in ColdLight.