I'm the hiring manager on this role, I can take this one.
ML engineers fall between data science and production engineering. They're responsible for doing their own data analysis and experimentation, plus some light engineering to implement their models in production. We also hire backend engineers to handle the more hardcore infrastructure tasks, so we don't expect quite the same insane level of coding ability as some companies do.
I am from Java(Back end development) background with 4 years of experience. I am really interested to learn and work for machine learning projects. Do you think I should try here?
Can you please send me job description if possible, my email id is swapnil.javanjal@gmail.com
Reddit is hiring engineers in ads quality for our brand-new NYC office! We’re looking for machine learning and backend engineers, both junior and senior.
The ads quality team is responsible for all the machine learning that powers our ads backend, as well as the inference and ML serving infrastructure.
Our backend is in flux, but we’re using: Spark and Scala on AWS for machine learning, as well as BigQuery on GCP for analysis. Our ads serving stack is written in go and runs on AWS.
If you’re interested shoot us an email at jobs+hn@reddit.com
Hmm, that shouldn't be the case. If you try again and feel like helping us out, feel free to email the discrepancy to support@hipmunk.com.
Sometimes there is an explanation (e.g. the flight doesn't exist, or has a 30hr layover), and other times we don't have the flight, in which case we want to alert our providers to fix it.
First discrepancy I found was the first search I ran (LHR - SYD one way Mon 3/11) but I'll try to give a more helpful general case answer.
- Consolidators [that you don't have commercial relationships with] have wholesale arrangements with airlines allowing them to sell tickets below the direct/GDS price. I suspect you're more reliant on ITA results than many other portals or meta-search sites and their prices are trivially easy to beat on competitive routes if you have a broader range of partnerships.
- Some consolidators put together logical itineraries not bookable via GDS or even direct with the airline. As an extreme example I once bought a cheap, logical pair of connecting long haul flights with Saudia via eBookers that Saudia's website wouldn't allow me to book at any price direct with them; I'd guess the airline didn't want to market anything involving a 5 hour layover in an under-redevelopment terminal in Riyadh. There are plenty of less glaringly obvious potential indirect itineraries that consolidators (potentially including Kayak as well as their affiliates) can identify with Innovata's minimum connect time file and schedule data[1] and other providers might have a more liberal view on reasonable connections than ITA
- Sometimes other sites' metasearch results' advantages are purely illusory, based on outdated results they've cached or screenscraped. I find Skyscanner LCC prices are frequently out of date, even though the actual ticket price is also usually cheaper than ITA's. Other sites might also create the appearance of more choice by not combining codeshared flights into one.
As for LON-SYD on Monday Kayak has a lot more results under $1000 than you. It's possible their much cheaper ticket prices with SIA are outdated or geolocation specific, but the cheapest flight they show (a Garuda two-stopper that's no more obviously agonising than your cheapest result but 10% cheaper) doesn't show on your engine at all. Then again, the price with Kayak had risen to only 5% cheaper than your best price when I started the booking process...
[1]disclosure: I sell that data, amongst many other things, although not to OTAs
Flash as a technology is a caveman compared to modern Javascript-based technologies. There is no audience that does not have JS enabled -- other than the fringe who would probably be better off using wget to browse the internet.
We said exactly the same things about Flash not so many years ago, including the pretentious dismissal of anyone who would dare not support the technology we wanted to use. Before that (and to some extent even now) it was operating systems: what kind of hippie doesn't run Windows, after all? It's pretty easy to justify technical decisions by pigeonholing anyone not using our same platform as The Kind Of People who we don't want giving us money anyway.
There are quite a few comments here complaining about the gaudy unusable interface produced by "modern JavaScript-based techniques", which in this case appear to provide virtually nothing that static HTML and CSS can't do. The dev team was lazy and wanted to show off, and screw anyone who doesn't think running gobs of arbitrary code should be a necessity for buying a ticket.
And for what it's worth my non-developer artist girlfriend has been using noscript for longer than I have. I'll let her know what modern devs think of her, though I don't think she's heard of wget.
I actually deliberately didn't use Django specifically because it's so complicated for beginners (and for pros, IMHO). I was trying to avoid magic. Webapp2, which is the web framework GAE users, and what we used in the course, is about as simple as you can get without starting from scratch.
Hey Steve, thanks for your work in this course. It's a terrific thing for you to do and it's my favorite course to recommend to beginners to webdev.
That said, the webapp2 decision is by far my biggest hesitation in recommending it. Webapp2 just doesn't have the community support; whether django is too "heavy" or flask too "light", they're both popular, which means they have great support, plugins, help on SO, and a lot of developers familiar with them.
Webapp2 also hasn't seen a release since 2012[0].
I write this having recently completed a small consulting project where I helped a team on GAE move from Webapp2 to django. The productivity gains just from using django plugins and builtins was insane. They had been on webapp2 because their dev learned webdev on your course, and I think he would have been much happier had the course included a more widely used framework. Lock-in on Appengine is also a major issue; the work to move off can be enormous, especially if you're using a framework designed for GAE.
Again, I totally understand the decision to use webapp2 and the course is still a huge net positive. But if you ever think about revising it, I'd recommend considering another framework, and possibly even using Elastic Beanstalk.
EDIT: I understand django can take a little more to get up and running, but most people learning Python are going to have to learn it very soon anyway. I learned how to program by doing the django tutorial, and didn't find it burdensome at all.
Sure, in the right hands Django (and Rails) and yield major productivity boosts.
I personally don't like working in frameworks of that size because I find myself fighting them to do what I want.
For the course itself, I really wanted the students to get as close to the server as possible. Then, when some of them switch to the heavier frameworks, they'll have a better idea of what the framework is doing and what is possible. The course had a time constraint on it as well.
All in all, your anecdote reenforces my feelings on the topic :)
Steve, please know that I am forever grateful for your work in your excellent Udacity course. It began a path for me that recently led to me changing careers to become a full time programmer.
Whatever people might say about webapp2 and App Engine, I think they were the right choice for the course. I had no problem moving on to other frameworks or moving away from App Engine, having gotten a solid understanding of the concepts from your course.
Hi Steve, I'm the author of the path and a big fan of your course - thanks so much for putting it together! However I would like to echo @rattray's point - when I started learning as a complete newbie to web-dev, I found Django much easier to pick up, less intimidating and more satisfying than Webapp2. The Udemy course I have included later teaches how to put up a simple landing page using Django/bootstrap and in 3-4 hours - you have tried frontend, backend and all. The joy of making something real, even if simple is a big motivator and confidence-booster, which IMO is key to self-learning. I actually started with your course, drifted away in the middle, and then came back after finishing the Django course.
Hi Steve, thanks for teaching! I really enjoyed it. I'm not sure I misremembered it as Django; maybe I had confused it at the time with Jinja? I was quite a newcomer then (still am in some respects, but much less so). Which of course means that to a certain degree, my confusion was simply par for the course. I know it's difficult to write a curriculum that completely caters to thousands of students, and I want to emphasize that you guys did a great job. There were a few topics that I would have liked to have covered that I didn't come to understand until later. I guess you can always go deeper on a subject, though.
Steve -- Wow, I have to admit I'm a little star-struck by your comment here. Your course is easily the most awesome single resource I've found for learning web development -- thanks for your work on it!
At reddit we had two tactics for the frontpage problem.
The first thing we added was the "rising" page, which used to be reddit's default "new" page. The rising page was a weighted new page. It was a little hacky, but worked for a while. I see they've changed this, which I think may have been a mistake, but I haven't thought about it in forever.
The second thing we did, which worked really well, was to have an up-and-coming link placed at the top of the list on the frontpage. This helped those borderline posts get more visibility. I had a reddit simulator I used to use to test things like this. That space appears to be a subreddit search box for me, so it seems they've moved onto another solution.
Randomness is an interesting idea, though it might be a bit difficult to fit into the way we cached things.
I think the problem that the randomized algorithm solve is not missing good content. It is to give the same chance for equally good content - or at least to minimize the error due to page limit.
to have an up-and-coming link placed at the top of the list on the frontpage
Two things:
1) either the up-and-coming are a few borderline posts taken in order and then here you'd just extended the front page with a few more posts (ie. it changes nothing)
2) either you'd carefully choosen a heuristic for those posts which is pretty much what is suggested here.
Anyway, thank you for sharing your experience with this problem, it's very interesting.
An other idea would be to fix the content quality and make variable the number of posts in the front page.
Did you ever think of modifying "best" to apply to submissions? A measure of how quickly an article is rising would be a pretty good metric for determining its rightful prominence.
FYI - you can access most of ITA's routing language on Hipmunk, if you type :: after an airport code. The advantage of using us is we can make you a booking link to actually purchase the itinerary.
Thanks, that's interesting. The issue for me is that Hipmunk only serves the US market and won't let me enter multiple airports as source or destination. My best results have started with a list of 20 possible sources and destinations which is unfeasible to search with any other publicly available service.
Have you considered watching to see if a user searches for, e.g., flights out of JFK immediately after an identical search for flights out of LGA, and then gently highlighting the feature?
There're a slew of new flight booking startups like http://wuhutravel.com and http://sillk.co that I believe are trying to entice younger travellers by apparently launching features like you just mentioned raldi (based on screenshots I've seen on dribbble from their designers)...
"Displaying data in multiple computer windows" is an absolutely ridiculous patent.
It would indeed be ridiculous if someone had a patent on displaying data in multiple computer windows, but that's not what was patented.
Here's claim #1 -- the other claims are similar in their specificity:
1. A system for synchronizing the presentation of data on a digital computer display, comprising:
first and second window-controlling means, each of said window-controlling means displaying information in at least one display window,
a synchronization control means, and
means for communications between said synchronization control means and each of said first and second window-controlling means;
wherein said first window-controlling means displays first information from a first source, and said second window-controlling means displays second information from a second source;
wherein said first information and said second information have sections, and at least one section of said second information corresponds to a section of said first information;
wherein said first window-controlling means sends a message to said synchronization control means over said means for communications indicating a change in viewing position to a new section of said first information;
wherein said synchronization control means sends a message to said second window-controlling means over said means for communications requesting a change in viewing position to a section of said second information corresponding to said new section of said first information; and
wherein said second window-controlling means displays said corresponding section of said second information on at least one display window.
Remember, patents cover methods, not purposes. This patent covers a specific method for displaying data in multiple windows, not the general concept of displaying data in multiple windows.
Mind you, I still think it's a dumb patent. But let's make sure we understand what we're lynching them for before we get out the torches and pitchforks, ok?
I wrote a split screen sprite editor for dos about 20 years ago. I bet it's still on a 720 floppy some place in my basement. In fact, I think I have a source printout filed away too..
>> But let's make sure we understand what we're lynching them for before we get out the torches and pitchforks, ok?
Have you ever used Hipmunk? It's user friendly and all, but it's not rocket science in the 50s. I think it's fair to assume that any patent they're violating is probably bullshit without too much research.
method and system for synchronizing the presentation of
data from different, but related, sources in different
windows of a computer display
It looks like this is person patented having multiple windows display related data in an organized fashion. Just as ridiculous, but differently so! And almost as vague.
Not seeing any specific numbers in the complaint after a quick scan of it. Did they state in the letter (which wasn't in the link) the amount they are seeking for the license?
Reddit is hiring engineers in ads ML for our brand-new NYC office. We’re looking for both junior and senior ML engineers.
The ads ML team is responsible for all the machine learning that powers our ads backend, as well as the inference and ML serving infrastructure.
Our backend is in flux, but we’re using: Spark and Scala on AWS for machine learning, as well as BigQuery on GCP for analysis.
Our ads serving stack is written in go and runs on AWS, and our ML includes logistic regression, GBDT, NLP embeddings, recommender systems, and more.
If you’re interested, shoot us an email at jobs+hn@reddit.com