OSCON: What it is.

This is the first of a few articles I’m going to write over the next couple of weeks related to the O’Reilly Open Source Conference, or what is more colloquially called OSCON. Before the conference event topic let us sync up on understanding exactly what this conference has been, what it was intended to be, and what it is today and its roots in open source.

OSCON was inaugurated in 1999 with its first conference held in Portland, Oregon. The location, that generally, has been the accepted home of OSCON. There have been other OSCON events in other locations but the sentiment remains – OSCON is a Portland conference and it’s a bit rough going in other cities hosting the conference.

OSCON started as a conference centered around the open source community since day one. It’s consistently held that course even when open source was regularly lamented, insulted, and cursed by the software industry. At one point Microsoft, the biggest of big software companies in the early days of OSCON relentlessly attacked open source. Steve Ballmer stated, “Linux is a Cancer” back in 2001.

Jim Allchin attacked open source as “the worst”.

Even the founder Bill Gates even went on record saying open source would make it so, “nobody can ever improve the software”.

Microsoft execs weren’t the only ones, just some of the richest, prominent, and loudest about berating the licensing model. Many corporations and others attacked it as communist and in other ways. But OSCON continued onward every year with solid turnout in Portland. The community continued to grow. But considering where we are now, that might seem a bit obvious. But way back then it wasn’t so obvious that open source licenses and related open models would become the way a vast percentage of software would be developed, as it is today.

But here we are!

OSCON started around those earlier days when open source was more often maligned than celebrated. At least in the business world and in the places the vast majority of us were, or would have been employed. When it started the conference aimed high and achieved a lot of victories in bringing together key people within the industry to grow open source development from multiple angles. As time went on OSCON expanded, as did its host library of open source books, on all the tools, options, and available solutions that were being created via open source licensing and the plethora of development paradigms.

Fast forward to today and OSCON is still that stalwart conference that brings people together, from those early days, to people that have just joined the open source communities today. This cohesive gathering of minds has a very low barrier for entry with its hallway pass, all the way to standard – more expensive fare – that covers the whole conference, specific and special gatherings, presentations, demos, and related activities.

Stay tuned, subscribe to the blog, and my next post I’ll take you on a whirlwind tour of more OSCON events, The New Stack‘s birthday at the conference, and more.

TRIP REPORT: Accelerate 2019 in Washington DC, I mean National Harbor!

Trip Time.

Today’s trip care of Alaska Airlines Flight 2 out of SEATAC Airport (Seattle & Tacoma’s airport) to National (Reagan) in Alexandria, Virginia. I’ll be staying there and commuting daily across the Potomoc River to Gaylord Resort and Convention Center (at National Harbor). I decided I’d write up something about this trip for a few specific reasons:

  1. I finally purchased a Bromptown Bicycle which I’ve been wanting to attain and use for my trips that require air travel or don’t have enough space for a proper bicycle.
  2. The adventure is entirely new to me, I’ve not been to these locations at any point in my life. New for me, new for those reading this (or adventuring along with me on my Twitch channel).
  3. I also picked up a number of new things that I want to see how they’ll work for streaming while on the go. These include; Android Phone, a new dual Go Pro + Phone mount for the bike, and among these a few existing devices like my trusty set of GoPro Cameras.
  4. I flew over via first class for various reasons. I thus, wanted to share some of the advantages and why I think it’s more than worth it to fly first class vs. coach and why companies should rethink their ideas around this when positions require frequent travel and working on the go.

Leaving Cascadia

The first thing I did was pack up the Brompton. I got a hardshell case to go along with it since I’d read during my research the airlines sometimes will snap off parts of the bike when a softshell case is used. The other advantage, the hardshell case has wheels! Inside this I also put my front mount messenger bag and some bungie cables so I can mount this stuff up to the bike upon arrival.

Once that was packed it was time to get the Mission Workshop ARKIV backpack I have locked and loaded. In my pack, which is the large of the two sizes, I get all my cloths, toothbrush, razors, and related amenities. In the side pouches that I mount up specific for longer trips I put my power brick and other electric plugs I’d need regularly in the quickest to access pouches. The other things go in various assorted pockets here and there. Since this is such a short trip, I also skip the outer backpack laptop pouch and just put the laptop in the inner sleeve.

Stats

Backpack w/ Laptop: 22 lbs. (with laptop)
Hardshell w/ Bike: 32 lbs.

All in all, a fairly heavy load, but the cool thing is with the configuration and post-arrival setup I have there isn’t actually much to carry. Backpack goes on my back and the hardshell case rolls along like a carry on. What makes it even easier, I’ve got an express bus with plenty of space and light rail with special areas specific for luggage like this. My 17x Express arrives on time, I board and ride off with my pack and hard shell sitting right next to me.

When I arrive downtown I merely pack up and roll downstairs to the Sound Transit LINK, board the train and off to the airport I go. No need to mess with a driver, no need for chatter or worrying about the implications of social anxiety or evils of clicking “don’t talk to me uber driver”. Just board and go. Then, read a book, check your phone, or whatever comes to mind. That’s what I do.

At the airport I strolled and rolled into the first class lounge, which I attempted to record via my new Android with the Twitch app. It… went oddly I’m assuming. Let’s take a look here.

Once I got situated in the lounge I made some pancakes – a tradition I have now – and sat down for some coding. The seats are comfortable, the views are great, and along with the coding I get to nerd out on all the planes taking on and off. At least, when one is flying in and out of C Gate at SEATAC. N Gates are kind of “meh”.

Eventually I left the relaxing lounge and headed into the boarding area of C Gates. The Alaska Air 737-900 arrived and started deplaning. With deplaning, boarding, and refueling done for the trip back east to DC we headed back out on the tarmac to queue up 15th in line to take off. Check that out, total plane traffic jam!

IMG_20190520_140122

Once in the air we flew through some piddly turbulence and into more clouds. Clearing 10,000 foot laptops came out and a little bit more coding resumed. In addition I started this post, took a few pictures, and knocked out a few other things I needed to do.

After a while food and drink services began. In first class anything over an hour can safely assume a meal will be served. This time it was tortellini or a sandwich of some sort. I got the tortellini. The meal is then served in three parts. Starting with a little salad and soup, entree, and then wrapped up with a desert.

The soup was tasty, I was somewhat surprised by this. Where as the salad was merely a salad with some cherry tomatoes, carrots, and greens. Nothing real special, but then of course it’s a salad so not like there’s much expectation.

The tortellini was pretty good. Even in comparison to other food outside of the airlines. A little salt and pepper brought it up just slightly to something I’d even have been happy with in an actual restaurant!

Finally we wrapped up with some Salt & Straw for desert. Considering this is an airplane I was kind of amazed they’d get Salt & Straw, but then again, Alaska Airlines does like to play to the local products and all!

After food, a couple more hours of coding and prep for the oncoming days of Accelerate.

Arrival in the District of Columbia

I arrived in DC, retrieved my Brompton and racked up the case it packs in and threw my bag on the front. Now for a 26 minute bike ride from the airport to Alexandria.

the-path.png

On the way, the setting was magnificent with honey suckle providing a divine fragrance while I road along the bike trail along the Potomac River. The moon shined down, almost full, and in spectacular fashion!

Eventually I arrived at my new home for the week. The ride a success, an experiment that it was.

Bootcamp!

NOTE: I am an employee at DataStax, just so you know, in case you didn’t know. I always do my best to give you the direct details, but just so you don’t think I’m being a shill here. Some people don’t seem to be able to determine how people and occupations are correlated, so I like to keep things on the up and up.

First day, or maybe it’s zero day on account of zero based indexes and all, bootcamp kicked off!

In the boot camp we covered a lot of material to get attendees up to speed on Apache Cassandra. To boot, Patrick McFadin announced that everybody would get to use DataStax Constellation, our new Cassandra as a Service offering – currently in test. The awesomeness about this whole bootcamp was that we provided Constellation for everybody, without a blip on the radar! No system issues came up, albeit we crossed a few programmatic network wires that were crisscrossed but that got remedied in seconds. With that all wrapped up, released, with a bow on top, bootcamp went off without a hitch. Also a huge shout out to the dozens of team members that provided support throughout the room of 300+ attendees!

Good times in success!

Day 1 – Announcing DataStax Constellation

The first day, based on our zero based index numbering of conference days, started with Billy Bosworth CEO of DataStax giving keynote number one.

In the keynote Billy talks about the direction of DataStax and the upcoming releases, and current releases as of Accelerate 2019. Then Chelsea Navo joins Billy to do a LIVE – emphasis on a LIVE demo of DataStax Enterprise (i.e. Apache Cassandra and all the goodies) running multi-cloud in Azure, AWS, and GCP.

9:23 – Demo of DataStax Enterprise – Multi-cloud in real life. “Not a pretend demo!

15:17 – Chealsea shows how we introduced a little chaos into the mix, and introduces the ability to simply and easily bring a datacenter down. In realtime, as the related reads and writes are occurring. Nothing stops, not even a blip… whoops, did I spoil it? Give it a watch, it’s a solid keynote demo!

At the 20 minute mark, Billy introduced DataStax Constellation. Watch it, learn more, etc. Following that Billy talks about Insights, which will be built in and services based AI, system health, and related capabilities within the cloud offering.

After the keynote, everybody broke out into technical sessions on a wide, very wide range of topics. From Apache Cassandra to DataStax to Kafka to Vue.js! Great day!

Day 2 – Apache Cassandra v4.0

On day two Billy starts off the keynotes, and introduces others including Nate McCall. Nate is the Apache Cassandra PMC Chair & committer to the project. He dove into the new features, capabilities, and changes of v4.

Next up is DataStax CTO (and founder!) and Apache Cassandra committer of yore, and more, Jonothan Ellis! (video is time point linked below so you can dive right into the talk).

After the keynotes more technical sessions. I attended some architecture discussions around graph and related technology. Lots of good conversations. I really enjoyed it, and to wrap it all up that evening we had an ending keynote with Keren Elazari.

Departure

I had a great time, and as I always like a little lagniappe. Here’s some photos from the trip back. If you’d like to join DataStax Accelerate for 2020, give a good look at the upcoming conference next year!

 

TRIP REPORT: O’Reilly Velocity & Software Architecture Conf 2019

This past week the O’Reilly Velocity and Software Architecture Conferences took place. I’ve attended both before, the 2nd time for Velocity in San Jose and the 2nd time for Software Architecture, however this time in San Jose and the last I attended was in London. The locations for these conferences dictate much about what is presented and how conversations, meeting and interacting, learning, and explorations take place during the conference, but more on those specifics in a moment.

The overarching theme from keynotes and many of the conversations I had met on a few key topics:

  1. When you’re building software, and you want to do it well you first and foremost must, absolutely must, invest in the people building your software.
  2. Focus on simplicity, remove complexity at every opportunity.
  3. Organizational structure can have direct impact in the complexity or simplicity of software, structure your organization efficiently and make every effort to keep it simple.

TLDR; Keep your people happy, focus on simplicity, minimize organizational noise from bureaucracy.

Topically Elaborating on Edge and Serverless

Ok, so a number of conversations came up around edge computing and serverless. Both interesting, but it also seems like there isn’t a strong play for the Enterprise is either space just yet. At this point however, a lot of enterprises are struggling with their Kubernetes, Cloud, and Hybrid solutions enough as it is that they haven’t even broached the edge compute and serverless realm. But that hasn’t stopped a lot of forward thinking individuals to start tackling how to cut out a useful spectrum of application with both edge and serverless.

Serverless Oh My!

Serverless, like so many other names, is kind of a garbage name at first. It effectively tells us nothing useful. It’s a word that requires more words just to give it meaning. It’s kind of like if I said “I like food!” What does this even mean? So, everybody eats food and most people like it, so what does “I like food!” actually mean? Same with serverless, because the first thing it doesn’t mean is a system that has no servers. What it generally means is something about your applications and you not doing anything with servers. It’s the mythic NoOps by simply removing the computers somehow.

Serverless, or simply code that executes via compute, and you get a result, has actually been around for some time. There were a number of startups that were well ahead of AWS Lambda and the other respective implementations that Azure and GCP have. These startups had been attempting to usher in what it took AWS’s massiveness and clout to actually get people to pay attention to. Serverless however has gone far beyond merely Lambda at AWS and now we’ve got to contend not only with the option in the existing cloud providers but where, how, and when can we get it into our data centers! The TLDR is that enterprise wants serverless and they’re interested in throwing it onto Kubernetes or whatever they’ve got. But often the infrastructure and systems to really make use of this simply isn’t there.

Most of the conversations I had evolved around the who, what, where, when, and how do we make use of these options for what we do? This is where most companies, at least enterprise and large companies, currently seem to be in the market. Then there are the companies that have already made the leap and are doing all sorts of stuff with Lambda and related serverless offerings. The gulf, that middle ground, doesn’t seem to have been broached by many others. Everybody, anecdotally of course, seems to be either trying to figure out how to start or already made the leap!

Edge Compute

This kept coming up, regardless of how or what people defined it as, it came up as something a number of people were very interested in. This notion, loosely based around using edge devices; smart phones, IoT devices, your car, or your washer for example, it could be almost anything. These devices do compute on the edge and thus the term. However it’s interesting because it isn’t like, for example, cloud computing that has core features like compute, storage, and related elements. Edge computing can run the gamut of any device doing any kind of work and the related capabilities of that device. It kind of leaves the space wide open. However, there were a few focal points that kept coming up.

The most common topic that came up around edge computing was doing tasks at point of presence. Such as having a phone do facial recognition, computing path finding (i.e. traffic directions), and related compute on the device versus round tripping it back to the cloud. It almost seems like after all these years of pushing things to the server we’re really starting in earnest to bring smart processes and tasks back to the devices we have in hand – no pun intended. It’s an interesting space, interesting paradigms, and I’m still not ready to call a specific thing within the world of edge compute and say, “that’s the next billion dollar idea”. Largely because, there are a lot of billion dollar ideas out there these days.

Speaking of edge compute and serverless, my fellow DataStaxian also had a few of these conversations on said topics. Patrick wrote up a post on a few observations over on the DataStax blog “Velocity Conference Shows What’s Gaining Velocity in Data Management“.

Geographic Location

As I mentioned, this set of conferences is in San Jose, the home of Silicon Valley, but the southern segment of the area. It’s a walk-able area with a number of places to break out from the conference and really dig into the hallway tracks (i.e. impromptu conversations!) that come up. For those willing to jump on the light rail, or scooter around, San Jose opens up even more to the local area providing a wide variety of coffee, food, and other operations to share conversations over.

All in all, the geographic location for the event is solid, being in the center of the city where it is. However one issue did arise, the Marriott lost power as an electrical fire in the control room of the multi-story hotel blew out the power. At last I checked upon leaving, it still didn’t have power! With the temperature at 105f going on multiple days at this point, the hotel because extremely hot inside, and being a kind of sealed airspace the air calculators also weren’t refreshing the air. That left a number of guests in less than stellar condition to attend, let alone attain value, from the conference events. Myself I ended up checking out in short order, getting sick the last day of the conference anyway, and being unable to provide the presentation that I had paired up with Lena (@lenadroid) for! I’ve been thinking, that maybe she and I can provide an online version of it for those that had wanted to hear us present on “Flexible Cloud Architectures: Decision Making Best Practices“.

Next year’s Velocity looks like it’ll be in Santa Clara, which doesn’t really excite me as it’s kind of a nebula of sprawling suburbia of boredom. This is were location becomes fundamental to what will or what can be the potential of secondary and tertiary conversations at a conference like this. Don’t get me wrong, the hallway track is excellent, but having options to step out and walk across the street from the event to converse further adds a tremendous value.

Santa Clara simply doesn’t do that unfortunately.

The fortunate thing between now and then, albeit the conference is moving to Santa Clara, they’re having subsequent conferences in the Velocity series in Berlin, and Software Architecture Conf series in the amazing cities of New York and Berlin. Those locations are worth traveling to for far more than a conference, increasing my interest in attending both of those future events. I’m looking forward to these!

Twitter Talk @VelocityConf

From @DataStaxDevs a thread! Click through for all ~17 parts.

Some Build Engineer Work – Click through for the whole construction thread.

Some of the Keynote Threads

Alena Hall – @lenadroid

Jessica Kerr – @jessitron

…and there were a bunch of others too, solid, check out the hash tag of #velocityconf to read up on more.

The Lagniappe

After the conference I finally managed to pick up a pocket Constitution.

If you’re ever in search of good coffee in San Jose, one place I found that’s tops is Academic Coffee, both the coffee and service are great. Good jovial crew and lots of cyclists in and out.

Making progress on the CaSMa, tweeted a bit on the topic while en route to the conference. If you’d like to get involved, please do let me know!

Other arbitrary statistics:

  • Stickers collected: 11 unique, ~7 of each. Total: 77 stickers.
  • T-shirt Swag: 2.
  • Conversations @ DataStax Booth: 11
  • Hallway Track Conversations: 7
  • Coffee Consumed: 9 over 3 days.
  • Twitter Filters Discussed: 123.
  • Fuel burned to compensate for electrical fire damage for the time of the conference: Approximately 5k gallons of fuel for the Marriott Hotel and no idea how much more fuel was or is still being burned to power the hotel.
  • Times the power still went off even with the diesel engine power trailer attached: 4.

Lots of Events & Topical Tech Discussions

This week we just had Ryan Zhang present at the Seattle Scalability Meetup. I did a little short presentation just showing some tools that I’ve been using as of late; DataGrip, and related schema migrations and Docker containers as I work through the schema migrations. It was a solid meetup and excellent conversation after meetup, big thanks to everybody who came out to the meetup and joined us for a round of drinks, amazing cheese curds and hummus at Collin’s afterwards! It was a great meetup and looking forward to getting together again on May 28th with Guinevere (@guincodes) presenting “The PR That Wouldn’t Merge“!

In other upcoming events that I’ll be at either presenting or attending. The events I’m attending let’s get talk, I’m always interested in meeting new people and learning about you’re working on, what you’re learning, and where and what efforts are of interest to you. For the events I’m presenting at the same applies, plus I’ll be standing among all the persons and presenting whatever tidbit of knowledge I’ve come to present. Hopefully it’ll be useful and informative for you and we can continue the conversation after the presentation and we all gain more insight, ideas, and ways to move forward more productively with our respective efforts. Here’s a list of the next big meetups and conferences I’m either speaking at or attending, and hope to see and meet many of you dear \m/ readers there!

Zhi Yang Presenting “Hierarchical Topic Modeling in Cancer Patients’ Mutational Profiles”

UPDATED: Video Added from the Conference!

zhi-yang.png

Introducing Zhi Yang > @zhiiiyang < presenting “Hierarchical Topic Modeling in Cancer Research”.

Topic models have been widely applied to extract topics from various range of documents or collections of texts, i.e., online customers reviews, medical records, scientific
journals, legal documents, books and etc. Its application facilitates the process for us to quickly understand the most featured and commonly shared information embedded texts without actually reading through the entire collection. In addition, topic models also allow us to access the contribution of each topic and its representations across different documents. Human genomes have been exposed to an assortment of mutational processes by contributing to unique patterns of somatic mutations. What would happen if we apply the same concept to the somatic mutations obtained from the cancer patients and look for “topics” of mutations? What would these “topics” tell us about the most important information for our health, genetic, risk factors for cancer and
something more that slip under the radar?

Shiraishi et al’s have proposed a topic model targeted for somatic mutations to capture the characteristics and burdens contributed by mutational processes. By closely examining the burdens, we’d like to compare them across different categories, say, for example, time, cancer subtype, ethnicity, smoking history, etc. Then, we’d like to develop the statistical machinery to infer the difference between the mutational profiles across different categories and associate the variations with the know exposures. This tool is potentially useful for identifying novel and existing mutational processes and correlating them with risk factors in which later can be used to monitor any treatment effects in personalized medicine and targeted therapy.

Read the publication here at biorxiv and come check out Zhi Yang’s talk at ML4ALL happening April 28th-30th in amazing Portland, Oregon! Get your tickets to attend here. For the schedule, our excellent sponsors docs for the conference, check out the ML4ALL Conference Site!