How do the North American amusement parks compare in popularity? If this question was to come up during a lunch discussion, I bet someone would pull out their smartphone and go to Wikipedia for the answer. But is Wikipedia the definitive answer - how can we tell if Wikipedia is wrong?
Uncategorized
Pearson's correlation measures the linear association between two variables. Because the correlation is bounded between [-1, 1], the sampling distribution for highly correlated variables is highly skewed. Even for bivariate normal data, the skewness makes it challenging to estimate confidence intervals for the correlation, to run one-sample hypothesis tests ("Is
Healthcare, like many industries, is in the midst of a paradigm shift, says Chris Donovan, Executive Director of Enterprise Information Management & Analytics for the Cleveland Clinic. "Historically, healthcare was really about intervention, and about taking care of you when you were sick and getting you better." That type of care
12 hours: That’s how quickly you can die from sepsis. Oh – you’ve never heard of sepsis? Not surprising. More Americans have heard of Ebola, a nearly non-existent condition in the U.S., than sepsis – a condition that affects more than 1.6 million Americans every year. Sepsis is the body’s
Get faster value out of your data by empowering business users to work with data on their own.
現地時間 2017/9/18,19,20 にてSASの秋のグローバルイベントである、「Analytics Experience 2017 (以下AX2017)」がアメリカ合衆国ワシントンDCで開催中です。今回は、日本から参加している筑波大学理工学群社会工学類経営工学主専攻4年生の村井諒さん,小林大悟さん,白鳥友風さん3名による参加レポート1日目を掲載します。 Academic Summit@AX2017 レポート by 筑波大学学生 今回私たち3人が参加しているAX2017の1日目は、AM11:00にスタートしたGeneral Sessionをはじめ、様々な講演が行われました。 中でも最後時間帯である19:00から催されたAcademic Summitについてご紹介させていただきます。 Academic Summitは、AX2017に出席しているデータサイエンスに精通する学生が、学生間や企業の方々との交流を深めるイベントです。このサミットでは、SAS Executive Vice President およびSAS Chief Technology OfficerであるDr.Oliver Schabenberger氏の基調講演や、Gather IQという、クラウドソーシングによってあるトピックに関する問題の解決を図るアプリの説明、女性の技術職としてのキャリアを支援する制度、学生によるアナリティクスのコンテストであるShootout Competition における入賞チーム3組についての紹介がされ、最後に自由な交流の時間が設けられました。 Schabenberger氏は純粋数学を学んだのち、データサイエンスの道へと進むことになった経緯や、現在SAS社が注目しているAmbient AnalyticsとDeep Learningについての説明、さらに自分自身を成長させるための教訓などをお話ししてくださいました。 またGather IQは、SAS社のミッションの一つである社会貢献のためのアナリティクスの価値を非営利で提供するということを体現していたと感じました。 このイベントの最後には自由にコミュニケーションをとる時間が設けられ、参加者の皆様は積極的に情報交換を行っていました。何より印象に残ったのは、同年代で飛び級で大学院に進学した人や、SASR Enterprise Minerを使いこなしモデリングを行っていた人がいたこと、さらに、参加者全員が英語で円滑にコミュニケーションを行っていたことです。 同年代の海外の学生たちがデータサイエンスに対して抱いている思いや、それに臨んでいく姿勢、自身のキャリアに対する考えなどを聞くことで、自分たちがこれからどうやってこの分野で戦っていくべきなのか、そのために何をするべきかなど、改めて深く考えさせられました。 また、意見交換をした際、私たちは英語の能力が十分でなかったということ以前に、初対面の人に話しかけることを躊躇してしまい、インターナショナルな場で積極的にコミュニケーションをとることの難しさを痛感しました。このようなためらいを減らし、自分から積極的に意思疎通を図っていくことの大切さを感じました。 残る二日間、データサイエンスに関する知識やノウハウだけでなく、グローバル人材にとって必要な素養も学んでいけたらと思います。
Have you heard the term “analytics economy” and wondered what it means? Or maybe you’ve wondered how your organization can use data and analytics to achieve economic gains. Now we have more than just data. We have accessible data, fueled by advances in compute power and connectivity, and interpreted by ever-more powerful
In my 25 years at SAS, I‘ve noticed the continued use of important algorithms, such as logistic regression and decision trees, which I’m sure will continue to be steady staples for data scientists. After all, they’re easy-to-use, interpretable algorithms. However, they’re not always the most accurate and stable classifiers. To
Toe bone connected to the foot bone, Foot bone connected to the leg bone, Leg bone connected to the knee bone,... — American Spiritual, "Dem Bones" Last week I read an interesting article on Robert Kosara's data visualization blog. Kosara connected the geographic centers of the US zip codes in
‘세계자연기금(WWF: World Wildlife Fund)’은 미래 환경 보호 리더를 양성하기 위해 전 세계적으로 교육 프로그램에 투자하고 있습니다. 이 리더들은 세계에서 가장 다양한 생물들이 서식하고 있지만 취약한 지역의 환경 보호를 위해 앞장서고 있는데요. 실제 여러 리더들이 WWF의 지원을 받으며 혜택을 누리고 있습니다. 그렇지만 여전히 생물학적 보존 가치가 높은 지역에 거주하는 많은 이들이 자연에
The intelligence community needs to revamp its approach to analytics -- and that means creating an analytics strategy that will change the status quo. The challenges facing analysts are consistent throughout the strategic, operational and tactical levels of intelligence operations. The intelligence cycle (see diagram below) is a great teaching
You might not know it by looking at me (I’m rounding up when I tell people I’m 5’8”) but I’m a huge basketball fan. I’ve been following the sport since I was 10, coaching it for the last decade and playing on teams throughout my life, still dedicating my winters
The government has an unfathomable amount of data -- and it grows more and more each year. This puts agencies in a unique and important position to use that data for good. Whether it be improving government operations, solving some of the nation’s biggest challenges or empowering citizens in new
ODER: Wie erstelle ich ein Edge Analytics Case auf Basis von SAS ESP, SAS Streamviewer und eines Modelltrucks? Das beschreibe ich in Teil 3. Rückblick: Im ersten Teil wurden die Idee und der Inhalt der SAS Streaming-Analytics-Demo beschrieben. Im zweiten Teil sind die einzelnen technischen Komponenten sowie die Software aufgelistet. Im
This article shows how to simulate data from a mixture of multivariate normal distributions, which is also called a Gaussian mixture. You can use this simulation to generate clustered data. The adjacent graph shows three clusters, each simulated from a four-dimensional normal distribution. Each cluster has its own within-cluster covariance,
This is the final post in my series of machine learning best practices. If you missed the earlier posts, start at the beginning, or read the whole series by clicking on the image to the right. While post four in the series was about combining different types of models, this
Big Data se ha convertido en la nueva moneda de los negocios en el mundo. Transformar los datos en información, la información en conocimiento y el conocimiento en oportunidades, es una cadena de valor que la mayoría de las empresas en el mundo ya siguen, pero que en Colombia ni siquiera
Hasta hace un tiempo las estrategias de crecimiento de una empresa estaban basadas en el alcance de los productos, el fortalecimiento de la fuerza de ventas y en responder a los avances que pudiera introducir la competencia. El marketing era de lejos el rey en esta etapa. Después vino la
Every fall, highways, backroads and neighborhood streets nationwide take on a noticeable yellow hue, as school buses carefully and methodically transport students back to school. In some areas, including Boston, this massive transportation exercise can present a number of challenges. Boston Public Schools (BPS) provided transportation for 25,000 students via
With Hurricane Irma recently pummeling pretty much the entire state of Florida, I got to wondering where past hurricanes have hit the state. Let's get some data, and figure out how to best analyze it using SAS software! I did a bit of web searching, and found the following map
Phil Simon weighs in on the value of getting your own hands dirty using self-service data prep.
If you’ve got SAS running within your organization, which is likely considering that over 90 percent of the largest global firms have SAS, you’ve probably been hearing a lot about SAS® Viya™, which drives many of the latest enhancements of the SAS platform. But amidst all the talk about microservices
Did you know that you can get SAS to compute symbolic (analytical) derivatives of simple functions, including applying the product rule, quotient rule, and chain rule? SAS can form the symbolic derivatives of single-variable functions and partial derivatives of multivariable functions. Furthermore, the derivatives are output in a form that
No haber nacido en la era de la conectividad ha dejado de ser una excusa válida. Tendencias como las del Big Data, Analytics, Cloud o Mobile están redefiniendo los negocios en la actualidad y poco importa si se trata de un líder de la vieja guardia o si tiene la
It's time to share another tip about working with ZIP files in SAS. Since I first wrote about FILENAME ZIP to list and extract files from a ZIP archive, readers have been asking for more. Specifically, they want additional details about the files that are contained in a ZIP, including
As Hurricane Irma makes its way through the Caribbean, and heads towards the United States, the big question on everyone's mind ... is the hurricane going to hit my city? Or, as some people like to say, "should I buy milk & bread?" Let's analyze & map some data to
I suppose we've all been watching Hurricane Irma rip through the Caribbean like a giant buzzsaw blade, with wind speeds over 180mph. This is one of those rare Category 5 storms. But just how rare are Category 5 hurricanes? According to the Wikipedia page, hurricanes with wind speeds >=157mph are
If you use SAS regression procedures, you are probably familiar with the "stars and bars" notation, which enables you to construct interaction effects in regression models. Although you can construct many regression models by using that classical notation, a friend recently reminded me that the EFFECT statement in SAS provides
This is the seventh post in my series of machine best practices. Catch up by reading the first post or the whole series now. Generalization is the learned model’s ability to fit well to new, unseen data instead of the data it was trained on. Overfitting refers to a model that fits
Managing big data doesn't always mean hiring more people and buying new tools.