SUMMER 2010 REU University of Illinois at Chicago (UIC)

Research Experience for Undergraduates in Computer Science

Sponsored by The National Center for Data Mining at UIC & The National Science Foundation

The National Center for Data Mining (NCDM) at UIC is hosting a Research Experience for Undergraduates (REU), Summer 2010.

Interactive Systems Biology App: Museums and Beyond Project

A team will develop an iPod Touch or iPad interactive game, “Make a Fly.” The app will combine transcriptional network data with images of embryo/larval/pupae biology to create a dynamic 3-D “Google-Earth” like semi-transparent information landscape. The learning experience will be goal-based and spawn a successfully emerged adult Drosophila mealangaster fly. The colorful app, after formative development and testing, will run on an iPodTouch or iPad tablet in the Chicago Museum of Science and Industry’s Genetics: Decoding Life exhibition. The user will guide the virtual animal’s developmental pathway by moving the pod forward in three dimensions to accelerate biological growth. Keeping the user’s activities aligned with normal regulatory events portrayed on the screen, in the absence of bumping into chromosomes and mutating critical genes or altering important temporal/spatial environmental signals, will result in winning the game.

Program Leaders: Dr. Robert L. Grossman (UIC) and Dr. Barry Aprison (UofC)

Dates: June 7 to August 2, 2010 (Shorter periods may be available)

Location: University of Illinois at Chicago in Chicago, Illinois

Applications received before May 1, 2010 will receive first consideration.

REU 2010 Announcement
REU 2010 Application

Eligibility and Prerequisites: Open to undergraduate students who are U.S. citizens or permanent residents. Computer science prerequisites: Networking, Operating Systems, C++, Python. (iPhone/iPod Touch OS 3.1; iPhone SDK 3.1.2; iPad/iPhone SDK 3.2beta).

Support: Participants will receive a $250 a week stipend and a travel subsidy of up to $400. Campus recreation facilities will be accessible to students in the program. Dormitory housing may be available through the program.

Please email your application form to: reu@ncdm.uic.edu

*Include “REU 2010 ” in subject line of all emails*

Contact person:
Shirley Connelly
322 SEO, MC 249
851 South Morgan Street
Chicago, IL 60607
Tel: 312-413-2176
Fax: 312-355-0373

Posted in Uncategorized | Comments Off

SC09: UDXnet BWC visualization

NCDM developed UDT high performance network protocol demonstrated across a 12,000 mile network at SuperComputing 2009 conference. This demonstration shows output of udx command line program, as it quickly scales up to over 8Gbps on the 12,000 mile network.

video

click to play video

Posted in SC, UDT | Comments Off

SC09: Canopy visualization

NCDM developed virtual network management library Canopy, demonstrated at SuperComputing 2009 conference. This demonstrations shows use of Canopy system to switch between two sets of Web and SQL virtual machines. As Web and SQL servers are alternated, database web interface updates visualize which resources are accessed. All Canopy operations demonstrated took place on osi layer 2 and without signaling the web and sql instances.

video

click to play video

Posted in SC | Comments Off

SC09: Sphere TeraSort performance visualization

NCDM developed Sphere platform demonstrated at SuperComputing 2009 conference running a TeraSort HPC benchmark. LAC Cluster Monitor is used to visualize compute node utilization during the Sphere TeraSort run. Each square represents a single node and it’s color indicates system load.

video

click to play video

Posted in SC | Comments Off

SC09: NCDM wins the Bandwidth Challenge Competition.

The NCDM/iCAIR/NRL team demonstrated three applications to show efficient bandwidth utilization in distributed data intensive applications. The first demo is processing very large datasets over 256 servers in 4 data centers connected by wide area high speed networks. The data analysis application exchanges data at over 100Gb/s among participating nodes. This application uses the open source software Sector/Sphere and UDT, developed by NCDM. The second demo is a cloud based image rendering application that delivers very high resolution visualization (computed by remote cloud systems) over long distance infiniband and IPv6. A hardware implementation of UDT was deployed to support the long distance infiniband protocol. The third demo showcased a light weight UDT variant called UDX, which can transfer data at 9.xGb/s using a single connection over a 10Gb/s network with 200ms RTT. Overall, our team achieved 25Gb/sec sustained throughput over a 200ms RTT, 12,000 mile path utilizing only seven servers at the SC09 floor.

Final Results of the SC09 Bandwidth Challenge:

Category – Rich
- manifold-process implementations including diverse mechanisms.
Winner: National Center for Data Mining/Univ of IL at Chicago
Category – Classic data movement.
Winner : Caltech
Category – Impact
- developments strongly affecting the target communities.
Winner: University of Tokyo
Category – Overall
Winner: National Center for Data Mining/Univ of IL at Chicago

Posted in SC, Sphere, UDT | Comments Off

UDX high performance protocol announced.

Nov. 17, 2009 Portland, Oregon

Today, at the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC09) conference, a research consortium demonstrated a new high performance transport protocol, UDX, for data intensive science applications. The UDX demonstration transported 9.3-9.6 Gbps streams using US national 7,000 mile testbed, C-Wave, based on lightpath channels within optical fiber.

Full Announcement (pdf).

Posted in SC, UDT | Comments Off

WTTW 11 Chicago presents a segement on Cloud Computing

‘Chicago has become a world center of “cloud computing.” As we continue our Chicago Matters: Beyond Burnham series, Rich Samuels explains what “cloud computing is and how you probably already use it on a daily basis.’ Video link.

Posted in Uncategorized | Comments Off

Sector: An Open Source Cloud for Data Intensive Computing

Presentation video from CloudSlam09.com

YouTube Link

Posted in Sector | Comments Off

ACM SIGKDD 2007 Service Award to Robert Grossman

Award Acceptance Video

ACM SIGKDD is pleased to announce that Robert Grossman is the winner of its 2007 Service Award. Robert Grossman is recognized for his key role in the development of open and scalable architectures and standards for the SIGKDD and Global KDD Communities.

The ACM SIGKDD Service Award is the highest service award in the field of data mining and knowledge discovery. It is given to one individual or one group who has performed significant service to the data mining and knowledge discovery field, including professional volunteer services disseminating technical information to the field, leading organizations or projects that contribute technically to the field as a whole, furthering KDD education, or increasing funding to the KDD community.

The previous SIGKDD Service Award winners were Gregory Piatetsky-Shapiro, Ramasamy Uthurusamy, Usama M. Fayyad, Xindong Wu, the Weka team lead by Ian Witten and Eibe Frank, and Won Kim.

The award includes a plaque and a check for $2,500, to be presented at KDD-2007 (The 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining) Opening Plenary Session on August 12, 2007 in San Jose, CA.

Grossman was one of the Founders of the Data Mining Group in 1998, which develops the Predictive Model Markup Language (PMML). He has been its Chair since it was started; and, during this time, it has released nine versions of PMML. PMML has seen wide spread adoption by the KDD community, in part, because:

  • PMML supports the sharing of statistical and data mining models in a platform and application independent fashion.
  • PMML supports architectures in which one application produces PMML models (called the PMML Producer) and another application, which may not even be a data mining application, consumes PMML models (called the PMML Consumer or scoring engine).
  • PMML supports KDD service oriented architectures.
  • PMML facilitates the storing of models in model repositories.
  • PMML supports applications in which models must be audited for compliance and other regulatory requirements.

For the past 10 years, Grossman has led two international testbeds for high performance and distributed data mining, which have been used by over fifty different organizations and groups to test, benchmark, and develop innovative technology for high performance and distributed data mining and knowledge discovery. The testbeds have also been used to develop and benchmark grid and service oriented technologies for mining large remote and distributed data sets. The first testbed was called the Terabyte Challenge and operated from 1995 to 1999, when working with a terabyte of data was still relatively rare. The second tested called the Teraflow Testbed was started in 2004 and will operate until at least 2008. Today when most distributed data mining takes place at 1-100 Mbps, the Teraflow Testbed can be used to mine data at 1-10 Gbps over wide area high performance networks.

Grossman has a long history of serving the KDD community. He was the Industrial Track Co-Chair for KDD 2006, the General Chair of KDD 2005, the Sponsorship Chair for KDD 2000 and 2001, and the co-chair of the First and Second SIAM International Conferences on Data Mining (SDM-01 and SDM-02).

Grossman has published over 140 research and technical papers in international conferences and journals. In 2005, he led the team that won the first annual High Performance Analytics Challenge at the ACM/IEEE International Conference for High Performance Computing and Communications (SC 2005). He also led teams that won prizes involving high performance data mining and related areas at SC 2006, SC 1999, and SC 1998, SC 1996 and SC 1995.

Grossman is the Director of the National Center for Data Mining at the University of Illinois at Chicago and the Managing Partner of Open Data Group.

ACM SIGKDD is pleased to present Grossman its 2007 Service Award for his significant service and contributions to the global KDD community.

2007 ACM SIGKDD Awards Committee:

Ramasamy Uthurusamy (General Motors, USA), Chair
Jerome Friedman (Stanford University, USA)
Jiawei Han (University of Illinois Urbana-Champaign, USA)
Vipin Kumar (University of Minnesota, USA)
Heikki Mannila (University of Helsinki, Finland)
Rajeev Motwani (Stanford University, USA)
Ramakrishnan Srikant (Google, USA)
Ian H. Witten and Eibe Frank (University of Waikato, New Zealand)
Xindong Wu (University of Vermont, USA)

Posted in KDD | Comments Off

Sterling Commerce adopts UDT

Sterling Commerce, an AT&T Inc (NYSE: T) company, today announced Sterling File Accelerator (SFA). SFA combines the power of the company’s Connect:Direct point-to-point file transfer software optimised for high-volume, secure, assured delivery of files with a new UDP Data Transfer-based file transport (UDT) – an application-level data transport protocol that overcomes the latency issues associated with transmission control protocol (TCP)-based transmissions.

source: iTWire.com.

UDT was developed by National Center for Data Mining

Posted in UDT | Comments Off