Ethanol Plant Predictive Regression Models: The Importance of Plant Data Analytics

dc.contributor.authorSchneider, Cassandra
dc.date.accessioned2019-04-19T19:59:20Z
dc.date.available2019-04-19T19:59:20Z
dc.date.graduationmonthMayen_US
dc.date.published2019en_US
dc.description.abstractThe modern use of data analytics is not new to production processes, however the substantial reliance of it in the ethanol industry has been increasing over recent years. Being able to pull larger amounts of data is important to monitor an ethanol plant’s KPI’s (key performance indicators). Taking data analysis to the next level of being able to run regression models and predictive type examination to accurately determine ethanol yield is the succeeding phase currently facing the ethanol business on the plant level. Using simple regressions and an industry survey to deliver data can help both ethanol plant personnel and vendor data scientists to work together to be able to use all the information on hundreds of ethanol plant variables that are gathered daily to provide predictive guidance. Over the years, the ethanol industry has also become a crucial business for imports and exports in countries, such as the United States, Canada, and Brazil. However, the ethanol industry does come with many risks and challenges and many of them are beyond an ethanol plant’s control. For this reason, the purpose of this thesis is to examine the importance and impact of data analytics in ethanol production, and to determine the value that predictive modeling of ethanol yield can have for an ethanol plant. An ethanol industry-aimed survey was developed, conducted, and data summarized. Dependent variable and independent variables for regression analyses were analyzed to see the trends for 2010 from an Excel extract provided by Plant ABC. A linear regression model of ethanol plant data was used in this thesis to be able to examine contemporaneous dependence of fifteen different variables with the dependent variable, ethanol yield. Regression modeling was also used to determine the factors that are statistically significant in predicting ethanol yield using other types of models with alternative functional forms, including the semi-log, double-log, and quadratic. Ethanol yield linear regression was estimated and showed that the independent variables of ratio milo, Drop pH, Drop DP4+, Drop Glucose, Drop Lactic Acid, and Drop Acetic Acid had p-values under 1% and have significant correlations to ethanol yield. The quadratic model yielded the lowest RMSE indicating the best predicted model out of the four models estimated.en_US
dc.description.advisorJason S. Bergtolden_US
dc.description.degreeMaster of Agribusinessen_US
dc.description.departmentDepartment of Agricultural Economicsen_US
dc.description.levelMastersen_US
dc.identifier.urihttp://hdl.handle.net/2097/39665
dc.subjectEthanolen_US
dc.subjectData Analyticsen_US
dc.subjectRegressionen_US
dc.subjectPredictive Analysisen_US
dc.subjectProduction Processen_US
dc.titleEthanol Plant Predictive Regression Models: The Importance of Plant Data Analyticsen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
CassandraSchneider2019.pdf
Size:
881.08 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: