2016 INFORMS Annual Meeting Program

INFORMS Nashville – 2016

123

MA06

102A-MCC

Social Media Analytics

Sponsored Session

Chair: Zhe Shan, University of Cincinnati, Cincinnati, OH,

United States,

zhe.shan@uc.edu

1 - Tree-based Models For Longitudinal Data

Peng Wang, University of Cincinnati,

wangp9@ucmail.uc.edu

Dan Liu, Brittany Green

Classification and regression tree (CART) has been broadly applied due to its

simplicity of explanation, automatic variable selection, visualization and

interpretation. Previous algorithms for constructing CART for longitudinal data

suffer from the computational difficulties in estimation of covariance matrix at

each node. We proposed to utilize the quadratic inference function (QIF) and

developed a new criterion, named RSSQ, to select the best splits. The proposed

approach incorporates correlation wihout estimating the correlation parameters.

Therefore we could improve the efficiency of the partition results and prediction

accuracy. This is joint work with Dan Liu and Brittany Green

2 - Mining Process Patterns Via Electronic Medical Record

Audit Logs

He Zhang, University of South Florida,

hezhang@usf.edu

Mining the process patterns in the access logs from information systems can

provide useful insights for the workow patterns. One important issue in process

mining is that the workow is usually highly dynamic and the access logs are

noisy. We presenta framework to analyze process models with noisy data at an

abstract level. We implement our approach using several months of data from a

large academic medical center. Empirical results show that our framework can

extract the process models effectively.

3 - Feature Selection For Quality Assessment Of Predicted

Protein Structures

Shokoufeh Mirzaei, California State Polytechnic University -

Pomona, CA,

smirzaei@cpp.edu

In the context of computational biology a scoring function determines the quality

of a predicted protein structure. The Goal of this paper is to find a subset of

protein features that are critical in identifying native protein structures in order to

develop a scoring function. In pursuit of this goal, our method of research consists

of 1) identify a set of protein features suggested by the literature, 2) use a variety

of feature selection methods to select the best subset of features 3) use deep

learning techniques in machine learning to identify new features. The final

Outcome of this research is a subset of protein features and a new scoring

function which predicts the quality of protein models.

4 - Healthcare Fraud Analysis Using Sequential Data Mining

Babak Zafari, Babson College, Boston, MA, United States,

zafari.babak@gmail.com

It is estimated that at least three percent of annual health care spending is lost to

overpayments. However, the large size and complexity of the health care system

make comprehensive auditing infeasible. This resulted in the use of data mining

approaches to detect unusual payments. In this talk, we propose the use of

pattern discovery methods to find the anamolies in the medical claims payment

data.

MA08

103A-MCC

Crowd-based Innovation

Invited: Business Model Innovation

Invited Session

Chair: Bilal Gokpinar, UCL, London, United Kingdom,

b.gokpinar@ucl.ac.uk

1 - Experience Breadth And Problem-solving In Crowdsourcing

Contests: An Empirical Investigation

Anant Mishra, George Mason University, Fairfax, VA, 22030,

United States,

amishra6@gmu.edu,

Nirup M Menon, Shun Ye

Online crowdsourcing contests have become a popular mechanism for addressing

challenging problems. In this study, we use a multi-dimensional classification

scheme to represent a contestant’s breadth of experience on a crowdsourcing

platform, and examine how it impacts her performance in a contest. Using

detailed archival data from TopCoder, a crowdsourcing platform that hosts

contests across software development problem domains (e.g., architecture, design,

testing), our results demonstrate that a contestant’s breadth of experience has a

nuanced relationship with her performance in a contest.

2 - Spatial Distribution Of Alternative Finance

Mingfeng Lin, University of Arizona, 1130 E. Helen St, Tucson, AZ,

85721, United States,

mingfeng@eller.arizona.edu

, Bryan Zhang

Will online alternative finance help reduce the geographical imbalance of capital

that the literature has long documented of the traditional finance? Are funding

distributions geographically more equitable online, and are there differences

across different types of crowdfunding forms? We answer these questions by

leveraging detailed transactions data from multiple major crowdfunding platforms

in the UK. We compare the spatial flow in online debt crowdfunding to bank

lending, and online equity crowdfunding to venture capital and private equity

financing. We also compare the funding flow between different types of

crowdfunding. Results show some surprising and interesting patterns.

3 - The Role Of Customer Investor Involvement In

Crowdfunding Success

Philipp Cornelius, University College London, London, United

Kingdom,

philipp.cornelius.12@ucl.ac.uk

, Bilal Gokpinar

Entrepreneurs and organisations increasingly use crowdfunding to fund

innovation projects through a large number of customer investments. The

growing literature on the topic has predominantly studied crowdfunding in terms

of its financing mechanism. The involvement of customers during crowdfunding,

however, goes beyond the provision of capital. As investors and prospective

product customers, crowdfunders want to influence crowdfunding campaigns.

Can project creators benefit from this or does an increased influence of the crowd

push products into too many directions and deter other customers from funding?

MA08