Encord Blog
Meet Mavis - Product Designer at Encord
Written by
Lavanya Cholaraju
View more postsLearn more about life at Encord from our Product Designer, Mavis Lok!
Mavis Lok, or ‘Figma Queen’ as we’d like to call her, thrives in using innovation and creativity to enhance the user experience (UX) and user interface (UI) of our products.
She listens closely to our customers’ needs, conducts user discovery, and translates insights into tangible and elegant solutions. You will find Mavis collaborating with various teams at Encord (from the Sales and Customer Success teams, to the Product and Engineering teams) to ensure that the product aligns with our business goals and user needs.
Hi, Mavis, first question is what inspired you to join Encord?
When I was planning the next steps in my career, I knew that I wanted to join an emerging and innovative tech startup. In the process, I stumbled upon Encord - with a pretty big vision of helping companies build better AI models with quality data. A problem that seemed ambitious and compelling.
I had my first chat with Justin [Encord's Head of Product Engineering], and he gave me great insights into the role, the company, and the domain space, which tied nicely with my design experience and what I was looking for in my next role.
I was evaluating many companies, and I made sure (and I'd recommend to anyone reading!) to speak to as many employees from the company I could meet. The more people I met from Encord, the more and more eager I became to join the team.
Could you tell me a little about what inspired you to pursue a career in product design?
Hah, great question! I was previously in creative advertising and was trained as a Creative/Art Director. During my free time, I would participate in advertising competitions where I would pitch ideas for brands, and I’d always maximize my design potential through digital-led ideas.
That brought me to work as a Digital Designer and then as a Design Manager, where I got my first glimpse of what it was like to work closely with co-founders, engineers, and designers. The company I was working at, was going through a transition from an agency to a SaaS type business model, and I found many of the skills I'd developed were actually an edge for what product design requires.
Having an impact in balancing business needs, and product development challenges whilst creating products that are user-centric and delightful to use - is why I love what I do every day.
How would you describe the company culture?
I think the people at Encord are what sets us apart. With a team of over 20 nationalities, it’s an incredible feeling to work in an environment where diversity of thought is encouraged.
The grit, ambition, vision, and thoughtfulness of the team are why I enjoy being part of Encord.
What have been some of the highlights of working at Encord?
Encord has given me the space to throw light on the impact that design can bring to the company and build more meaningful relationships with the team and, of course, our customers.
Another big highlight for me is practicing the notion of coming up with ideas rapidly whilst being able to identify the consequences of every design decision. Brainstorming creativity whilst critically is something I hold dearly in my creative/design life, so it’s definitely a highlight of my day-to-day at Encord.
On a side note, Encord is also a fun place to work. Whether it is Friday lunches, monthly social activities, or company off-sites, there are plenty of opportunities to have a good time with the team.
Lastly, what advice would you give someone considering joining Encord?
The first thing I would say is you have to be authentic during the interview, and you should also genuinely care about the mission of the company because there is a lot of buzz around the AI space right now - genuine interest lasts longer than hype.
I would recommend reading our blogs on the website; it's a great place to start, as you can gain a lot of insight from it. From learning more about our customers, to exploring where our space is headed.
We have big plans for 2023 & are hiring across all teams. Find here the roles we are hiring for.
Build better ML models with Encord
Get started todayWritten by
Lavanya Cholaraju
View more postsRelated blogs
Top 10 Data Annotation and Data Labeling Companies [2024]
With increasing reliance on computer vision (CV) systems in multiple industrial domains, the demand for robust data annotation solutions is rising exponentially. The most recent reports project the data annotation tools market to have a compound annual growth rate (CAGR) of 21.8% from 2024 to 2032. However, as several companies emerge offering annotation platforms and services, finding a cost-effective provider is challenging. While many platforms offer advanced annotation features, only a few meet the scalability and security requirements essential for enterprise-level CV applications. This article discusses the ten best video and image annotation companies in 2024 to help you with your search. The following lists the companies we think are driving the data annotation space: Encord iMerit Appen Label Your Data KeyMakr TrainingData SuperbAI Kili Technology Telus International SuperAnnotate CogitoTech LabelBox Top 12 Data Annotation and Data Labeling Companies Data annotation companies offering labeling solutions must meet stringent security and scalability requirements to match the high standards of the modern artificial intelligence (AI) space. Below are the twelve top companies, ranked based on the following factors: Data security protocols: Compliance with data security regulations and use of encryption algorithms. Scalability: The solution’s ability to handle large data volumes and variety. Collaboration: Tools allowing different team members to collaborate on projects. Ease of use: A user-friendly interface that is intuitive and easy to navigate. Supported data types: support for different modalities such as video, image, audio, and text. Automation: AI-based labeling for speeding up annotation processes. Other functionalities for streamlining the annotation workflow include integration with cloud services and advanced annotation methods for complex scenarios. Let’s explore each company's annotation platforms or services and see the key features based on the above factors to help you determine the most suitable option. Encord Encord is an end-to-end data platform that enables you to annotate, curate, and manage computer vision datasets through AI-assisted annotation features. It also provides intuitive dashboards to view insights on key metrics, such as label quality and annotator performance, to optimize workforce efficiency and ensure you build production-ready models faster. SOTA Model-assisted Labeling and Customizable Workflows with Encord Annotate Key Features Data security: Encord complies with the General Data Protection Regulation (GDPR), System and Organization Controls 2 (SOC 2), and Health Insurance Portability and Accountability Act (HIPAA) standards. It uses advanced encryption protocols to ensure data security and privacy. Scalability: The platform allows you to upload up to 500,000 images (recommended), 100 GB in size, and 5 million labels per project. You can also upload up to 200,000 frames per video (2 hours at 30 frames per second) for each project. See more guidelines for scalability in the documentation. Collaboration: You can create workflows and assign roles to relevant team members to manage tasks at different stages. User roles include admin, team member, reviewer, and annotator. Ease-of-use: Encord Annotate offers an intuitive user interface (UI) and an SDK to label and manage annotation projects. Supported data types: The platform lets you annotate images, videos (and image sequences), DICOM, and Mammography data. Supported annotation methods: Encord supports multiple annotation methods, including classification, bounding box, keypoint, polylines, and polygons. Automated labeling: The platform speeds up the annotation with automation features, including: - Segment Anything Model (SAM) to automatically create labels around distinct features in all supported file formats. - Interpolation to auto-create instance labels by estimating where labels should be created in videos and image sequences. - Object tracking to follow entities within images based on pixel information enclosed within the label boundary. Integration: Integrate popular cloud storage platforms, such as AWS, Google Cloud, Azure, and Open Telekom Cloud OSS, to import datasets. Best for Teams looking for an enterprise-grade image and video annotation solution to produce high-quality data for computer vision models. Pricing Encord has a pay-per-user pricing model with Starter, Team, and Enterprise options. Learn more about automated data annotation by reading our guide to automated data annotation. iMerit iMerit offers Ango Hub, a data annotation solution built on a generative AI framework that lets you build use-case-specific applications for autonomous vehicles, agriculture, and healthcare industries. iMerit Key Features Collaboration: The Ango Hub solution lets you add labelers and reviewers to customized workflows for managing annotation projects. Ease-of-use: The platform offers an intuitive UI to label items, requiring no coding expertise. Supported data types: Ango Hub supports audio, image, video, DICOM, text, and markdown data types. Supported labeling methods: The solution supports bounding boxes, polygons, polylines, segmentation, and tools for natural language processing (NLP). Integration: The platform features integrated plugins for automated labeling and machine learning models for AI-assisted annotations. Best for Teams searching for an integrated labeling platform for annotating text, video, and image data. Pricing Pricing information is not publicly available. Contact the team to get a quote. Appen Appen offers data annotation solutions for building large language models (LLMs) by providing a standalone labeling platform and data labeling services through expert linguists. Appen Key Features Workforce capacity: Appen’s managed services include more than a million specialists speaking over 200 languages across 170 countries. With the option to combine its platform with its services, the solution becomes highly scalable. Supported data types: Appen’s platform lets you label documents, images, videos, audio, text, and point-cloud data. Supported annotation methods: Labeling methods include bounding boxes, cuboids, lines, points, polygons, ellipses, segmentation, and classification. Instruction datasets: The company also offers domain-specific instruction datasets for training LLMs. Best for Teams looking for a hybrid solution for building multi-modal models for text and vision applications. Pricing Pricing is not publicly available. Label Your Data Label Your Data is a data annotation service provider offering video and image annotation services for CV and NLP applications. Label Your Data Key Features Data security: The company complies with ISO 27001, GDPR, and CCPA standards. Workforce capacity: Label Your Data builds a remote team of over 500 data annotators to speed up the annotation process. Supported data types: The solution supports image, video, point-cloud, text, and audio data. Supported labeling methods: CV methods include semantic segmentation, bounding boxes, polygons, cuboids, and key points. NLP methods include named entity recognition (NER), sentiment analysis, audio transcription, and text annotation. Best for Teams looking for a secure annotation service provider for completely outsourcing their labeling efforts. Pricing Label Your Data provides on-demand, short- and long-term plans. Keymakr Keymakr is an image and video annotation service provider that manages labeling processes through its in-house professional experts. Keymakr Key Features Labeling capacity: You can label up to 100,000 data items. Supported data types: The platform supports image, video, and point-cloud data. Supported labeling methods: Keymakr offers annotations that include bounding boxes, cuboids, polygons, semantic segmentation, key points, bitmasks, and instance segmentation. Smart assignment: The solution features a smart distribution to match relevant annotators with suitable tasks based on skillset. Performance tracking: Keymakr provides performance analytics to track progress and alert managers in case of issues. Data collection and creation: The company also offers services to create relevant data for your projects or collect it from reliable sources. Best for Beginner-level teams working CV projects, requiring data creation and annotation services. Pricing Pricing is not publicly available. TrainingData TrainingData is a Software-as-a-Service (SaaS) data labeling application for CV projects, featuring pixel-level annotation tools for accurate labeling. TrainingData Key Features Data security: The company provides a Docker image to run on your local network through a secure virtual private network (VPN) connection. Scalability: You can label up to 100,000 images. Collaboration: TrainingData’s platform lets you create projects and add relevant collaborators with suitable roles, including reviewer, annotator, and admin. Supported labeling methods: The platform offers multiple labeling tools, including a brush and eraser for pixel-accurate segmentation, bounding boxes, polygons, key points, and a freehand drawer for freeform contours. Integration: TrainingData integrates with any cloud storage service that complies with cross-origin resource sharing (CORS) policy. Best for Teams looking for an on-premises image annotation platform for segmentation tasks. Pricing TrainingData offers free, pro, and enterprise packages. SuperbAI SuperbAI offers multiple products for building AI models, including a data management platform, a labeling solution, and a tool for training, evaluating, and deploying models. SuperbAI Key Features Data security: SuperbAI complies with SOC standards and encrypts all data using Advanced Encryption Standard - 256 (AES-256). Collaboration: The platform offers access management tools and lets you invite team members as admins, labelers, and managers. Supported data types: SuperbAI supports images and videos in PNG, BMP, JPG, and MP4 formats. It also supports point-cloud data. Supported labeling methods: The solution supports all standard labeling methods, including bounding boxes, polylines, polygons, and cuboids. Integration: The platform integrates with Google Cloud, Azure, AWS, and Slack. Best for Teams looking for an integrated data management solution for training machine learning algorithms. Pricing SuperbAI offers starter and enterprise packages. Kili Technology Kili Technology offers an intuitive labeling platform to annotate data for LLMs, generative AI, and CV models with quality assurance features to produce error-free datasets. Kili Technology Key features Collaboration: The platform lets you assign multiple roles to team members, including reviewer, admin, manager, and labeler, to collaborate on projects through instructions and feedback. Ease-of-use: Kili offers a user-friendly UI for managing workflows, requiring minimal code. Supported labeling methods: The tool supports bounding boxes, optical character recognition (OCR), NERs, pose estimation, and semantic segmentation. Automation: Kili supports automated labeling through active learning and pre-annotations using ChatGPT and SAM. Best for Data scientists looking for a lightweight annotation solution for building generative AI applications. Pricing Pricing depends on the number of items you need to label. Telus International Telus International’s Ground Truth (GT) studio offers three platforms as part of a managed service to build training datasets for ML models. GT Manage helps with people and project management; GT Annotate lets you annotate image and video data. GT Data is a data creation and collection tool supporting multiple data types. Telus International Key Features Data security: GT Annotate complies with SOC 2 standards and implements two-factor authentication with firewall applications and intrusion detection for data security. Collaboration: GT Manage features workforce management tools for optimal task distribution and quality control. Supported data types: You can collect image, video, audio, text, and geo-location data using GT data. Supported labeling methods: GT Annotate supports bounding boxes, cuboids, polylines, and landmarks. Best for Teams looking for a complete AI solution for collecting, labeling, and managing raw data. Pricing Pricing information is not publicly available. SuperAnnotate SuperAnnotate offers a data labeling tool that lets you manage AI data through collaboration tools and annotation workflows while providing quality assurance features to produce labeling accuracy. SuperAnnotate Key Features Collaboration: SuperAnnotate lets you create teams and assign relevant roles such as admin, annotator, and reviewer. Ease-of-use: The platform has an easy-to-use UI. Supported data types: SuperAnnotate supports image, video, text, and audio data. Supported labeling methods: The platform has tools for categorization, segmentation, pose estimation, object tracking, sentiment analysis, and speech recognition. Best for Teams looking for an annotation solution to build generative AI applications. Pricing The platform offers free, pro, and enterprise versions. Cogito Cogito is a data labeling service provider that employs a large pool of human annotators to deliver annotations for generative AI, CV, content moderation, NLP, and data processing. Cognito Key Features Data security: Cogito complies with GDPR, SOC 2, HIPAA, CCPA, and ISO 27001 standards. Supported data types: The platform supports image, video, audio, text, and point-cloud data. Automation: Cogito uses AI-based algorithms to label large data volumes. Best for Startups looking for a company to outsource their AI operations. Pricing Pricing is not publicly available. Labelbox Labelbox offers multiple products for managing AI projects. Its data labeling platform allows you to annotate various data types for building vision and LLM applications. LabelBox Key Features Data security: Labelbox complies with several regulatory standards, including GDPR, CCPA, SOC 2, and ISO 27001. Collaboration: Users can create projects and invite in-house labeling team members with relevant roles to manage the annotation workflow. Ease-of-use: Labelbox has a user-friendly interface with a customizable labeling editor. Automation: The platform supports model-assisted labeling (MAL) to import AI-based classifications for your data. Integrability: Labelbox integrates with AWS, Azure, and Google Cloud to access data repositories quickly. Best for Teams looking for labeling solutions to build applications for e-commerce, healthcare, and financial services industries. Pricing Labelbox offers free, starter, and enterprise versions. Still confused about whether to buy a tool or go for open-source solutions? Read some lessons from practitioners regarding build vs. buy decisions Data Annotation Companies: Key Takeaways CV applications are driving the current industrial landscape by innovating fields like medical imaging, robotics, retail, etc. However, CV’s rapid expansion into these domains calls for robust data annotation tools and services to build high-quality training data. Below are a few key points regarding data annotation companies in 2024. Security is key: With data privacy regulations becoming stricter globally, companies offering annotation solutions must have compliance certifications to ensure data protection. Scalability: Annotation companies should offer scalable tools to handle the ever-increasing data volume and variety. Top annotation companies in 2024: SuperAnnotate, Encord, and Kili are the top 3 companies that provide robust labeling platforms and services.
Feb 23 2024
8 M
Why AI Is the Mother of All Unicorns
"Say you want to watch a movie. To choose, you'll want to know what movies others liked and, based on what you thought of other movies you've seen if this is a movie you'd like. You'll be able to browse that information. Then you select and get video on demand. Afterward, you can even share what you thought of the movie. But thinking of it only in terms of movies on demand trivializes the ultimate impact. The way we find information and make decisions will be changed. Think about how you find people with common interests, pick a doctor, and decide what book to read. Right now, reaching out to a broad range of people is hard. You are tied into the physical community near you. But in the new environment, because of how information is stored and accessed, that community will expand. This tool will be empowering, the infrastructure will be built quickly and the impact will be broad." - The Bill Gates Interview, Playboy Magazine, July 1994 Sound familiar? Asked what else the personal computer was supposed to do other than process documents, Bill Gates prophesied the changes brought about by the coming of the information age that modern-day tech giants have since realized. From video-on-demand and movie recommendations (Netflix) to the way we find information (Google) to how you find people with common interests (Facebook) and deciding what book to read (Amazon 1.0), Gates' vision of the transformation that the information age would bring about turned out in more ways than one could conceivably imagine at the dawn of the Internet revolution. The Coming of the AI Revolution Fast forward 20 years, the AI revolution has begun. It will fundamentally transform our world, much just like the advent of the atomic bomb, microprocessor, personal computer, and the Internet. If the wealth generated from the emergence of each of these technologies offers any indication, we are poised to witness an unprecedented accumulation of wealth. As with any prophesied significant platform shift, there's a real risk that they fail to materialize in a big way at a particular moment in time (e.g. Web3, Blockchain, Crypto, Metaverse) or that they take much longer than anticipated to play out (admittedly, crypto can still find an actual use case). Until as recently as ten years ago, almost all AI systems failed to demonstrate significant value, and many still do not (e.g., purely logic-based AI systems and symbolic AI - the dominant paradigm from the 1950s to the mid-1990s - are still primarily research interests). We could be in for another AI hype cycle that may eventually fizzle. As an eternal optimist and founder of an AI company looking to raise a Series B in the not-too-distant future, I won't bother spelling out why AI is overrated. Instead, I'll argue why it will change the world. When I explain AI to my parents, I describe it as a new form of dynamic software built on answers, unlike traditional status software built on rules. Put simply, comparing AI to conventional software is like saying "show" instead of "tell." What's exciting about AI is that dynamic and answer-based software will enable us to create new products, applications, and systems that can solve unsolved problems that, until now, have been reserved for human cognition. Self-driving cars are the most obvious example - while traditional software can handle simple tasks like driving straight, building a fully autonomous vehicle would require an overwhelming number of static rules to cover even the basics of navigation. It is not a leap to believe that the total addressable market (TAM) of problems only solvable by human cognition is orders of magnitude higher than that of any of the problems for which we use traditional software. As AI can augment and - in some cases - replace humans, it can produce what I think of as "non-linear" productivity outcomes. Here are a few contrived examples across various vertical use cases to illustrate the potential non-linearity of AI systems: Building a faster car to reduce the amount of attention required to drive from A to B (linear) vs. self-driving vehicle (non-linear) More efficient organization of leads and tasks in a CRM system with a slightly better UI for salespeople (linear) vs. AI talking avatars that allow for infinite scaling of the salesperson (non-linear) Improved diagnostic equipment that provides more detailed images for radiologists to analyze (linear) vs. AI-driven systems that scan medical images and highlight potential anomalies for doctors or even predict possible illnesses before symptoms manifest based on health data (non-linear) Better tractors and machinery to help farmers plant and harvest crops (linear) vs. drones and robots that monitor the health of individual plants, apply precise amounts of fertilizer or pesticide, and harvest crops with minimal human intervention (non-linear) A digital learning and education platform with improved video lectures and homework targeted specifically at programming (linear) vs. an adaptive chatbot that can be prompted to "explain this concept like I'm 12 years old" (non-linear) The digital learning example is interesting as it is playing out in real-time: Chegg, the education technology company, saw its stock price tumble 47% (down ~63% year-to-date) after admitting that ChatGPT was pressuring its subscriber growth, leading them to suspend their full-year outlook. You get the idea. Just as the Internet's value skyrocketed with evolving applications, tools, and increased user participation, so too will the AI sector's worth. Despite the Internet's basic components remaining similar to those of the early 1990s, its value has grown exponentially over 20 years due to expanded applications and user engagement. As more individuals and businesses embrace AI and develop applications, the supporting tools and infrastructure will improve. Increased data availability will also enhance product quality. This cyclical improvement will fuel exponential growth in AI. Undoubtedly, AI is poised to be the next major technological platform shift within the next 20 years. While previous technological revolutions, like the Internet or the Industrial Revolution, were monumental in reshaping societies and economies, AI encapsulates something far more profound: the essence of human cognition. The wealth generation from novel solutions to previously unsolvable problems, combined with the heightened productivity and efficiency across all sectors, implies that the economic impact of AI could dwarf that of all prior technological shifts. The emergence of AI will give birth to an unfathomable number of unicorns. Who Wins: Titans, Challengers, or Innovators? Ok, so AI will be huge, but who wins the biggest slice of the pie, and where will the most value be generated? While the Twitter VC community may have its own predictions, here are my thoughts on a potential outcome. I could of course be entirely wrong. Early winners like NVIDIA have already experienced a surge in their stock price, and investors believe that generative AI and LLM developers will be the next big thing, as evidenced by the high valuations and significant investment flowing into those companies. The landscape is already fiercely competitive, especially among "neo" foundation model/LLM providers (e.g., Cohere, Anthropic, Mistral). Given the high valuations and evolving competitive landscape, I question the viability of venture-scale returns for most of these new entrants. There is a limit to how many chatbots the market can absorb, after all. Additionally, OpenAI is also so far ahead (8 years of R&D and billions of queries via ChatGPT generating valuable RLHF data) that it will be difficult for any of these companies to catch up. Perhaps one or two will succeed, but for emerging LLM companies to truly thrive, they will likely need to uncover unique niches or pivot towards refining larger models using specialized, proprietary datasets for distinct needs and/or partnering with downstream application developers. Some corporate/venture combinations could also happen, where partnerships like OpenAI/MSFT, Anthropic/Google, and Cohere/Meta-type will combine distribution and data advantage with R&D expertise. Elad Gil made some interesting observations on this here. Separately, foundation model providers will likely realize lower margins in the first few years of operation, as they more closely resemble 'hardware-type companies' with significant upfront training expenses. However, this does not mean that developers who create applications using these large models won't achieve considerable margins even if the TAM of those markets is much smaller, for example, by offering specialized expertise, products, and other value-added services related to these models. Jasper is an example of a company that has done this perfectly - they've built a product that serves marketers, and just marketers, well. All things considered, the AI market will probably resemble that of the current software market in 20 years. There will be a few huge "Big AI" companies with over $100 billion in revenue (this could very well end up being the foundation model developers, but it could also be companies that we haven't even conceived of yet) and a diaspora of large companies focusing on specific applications (e.g., Stripe for payments, Uber for transportation, Figma for design - this could be Jasper for marketers, Cruise for autonomous vehicles, Viz AI for medical imagery, and so on). For context, Apple, Microsoft, Amazon, Meta, and Alphabet constitute ~$9 trillion of the value of the NASDAQ's ~$22 trillion market cap. This is a substantial chunk, no doubt, but the total size of the pie is undoubtedly only going to get bigger as the AI market gets underway and secular trends in technology continue to reverberate. Why This Time is Different Technological revolutions often occur due to a convergence of pivotal factors, and the AI sector is currently experiencing such a juncture. Similar to the Internet's ascendance, which was enabled by ubiquitous personal computers and faster connectivity, the current AI boom is a product of simultaneous advances in computing power, vast data availability, and increasingly advanced models. Eric and I founded Encord at a pivotal moment when object detection models transitioned from often being erroneous and requiring highly controlled "sandbox"-type environments to delivering tangible ROI. Similarly, the release of ChatGPT marked a paradigm shift in how we approached natural language processing and understanding. Looking into the near future, I anticipate AI delving deeper into multi-modal applications, offering higher ROI and increasingly viable solutions to more complex problems, and even stepping into realms of human reasoning. In short, the market is just getting started. The value, revenue, TAM, etc., will naturally accrue as the complexity of the problems that we solve with AI increases. After all, it was impossible to stream a movie over your Internet connection 20 years ago, but now its table stakes.
Aug 24 2023
4 M
Meet Denis - Full Stack Engineer at Encord
We sat down with Denis Gavrielov, Full Stack Engineer at Encord, to learn more about his day-to-day. Denis is an essential player on the Engineering team, he has been a part of Encord's exciting journey from a small 8-person team to the dynamic startup it is today. He walks us through the highs and lows of his experiences, the camaraderie he shares with his team, and the thrill of working on some truly fascinating projects. If you've ever wondered what it's like to work at a startup like Encord, strap in and join us for this conversation! Denis, first question. What inspired you to join Encord? I applied about two years ago when the team was still getting started (we were eight people strong — all engineers!). At that time, I wasn't sure if I wanted to join a startup as I had been working in a larger company up until then (Bloomberg). I met most of the team throughout the interview process, and the more I spoke to everyone, the more I felt I'd really enjoy working with them. I also had a great first impression of the founders (Eric & Ulrik) and the more I spoke with them the more I got a sense for how strong their skillset was and how unique and unparalleled of an opportunity it'd be. The combination of team, founders, and opportunity is what ultimately led me to join Encord. I still remember my first week sitting in our 10m2 office space in the middle of Soho without any meeting rooms or kitchen, quite different from today! Very different from now indeed! What does a typical day as a full-stack engineer at Encord look like? Every day is a bit different. Some people like to focus on back-end engineering and some like front-end work — I like to work on both! Typically, I start by catching up with everyone's messages on Slack. I might have one or two meetings in the morning before starting to code. Throughout the day there's a lot of collaboration via impromptu short catch-ups in person or on Slack. During these meetings, I often discuss architectural designs and problem-solving strategies with more senior team members. A significant component of my day is spent coding and solving problems. And I guess you fit in lunch in between that as well, right? That's true, a good lunch break is important for recharging! I usually grab lunch with a few colleagues — if the weather is nice, like during the summer, we enjoy our lunch outside. On Fridays, the entire team gets together in the office for a company lunch, that's always fun and one of my favorite Encord traditions. You mentioned teamwork and customer centricity — how would you describe the company culture at Encord more broadly? The company culture is very collaborative. The founders and senior managers are encouraging and open to suggestions, so it's easy to take (good) decisions independently. We try to maintain a relaxed office culture focused on team success, where people are free to work in a manner that works best for them and for us as a team more broadly. People are always trying to help each other out and break down any silos there might arise — although we've been very intentional though about building the right environment, so luckily there are very little silos in the first place. For example, our engineering and commercial team all work in one big room and getting context or feedback is incredibly quick. Another key part of our culture is being able to adapt quickly. Previously, I was working at Bloomberg in an infrastructure team, where we had to plan everything very precisely. Everything had to be correct from the start, and every decision would need to go through many layers of approval and planning sessions. Speed of progress was naturally quite different. Here at Encord, we are a nimble, rapid team — we move quickly and have been able to achieve things that teams many times our size have struggled with. Customers tell us every week how much better our product is compared to other alternatives they're looking at. It is always very rewarding when we hear this feedback. What we aim for as a team is to be great — ship high-performing industry-leading products quickly, get feedback from customers and prospects early, and consistently focus on building what customers and users really want. I think these are the two principles that stand out to me as an engineer the most. Can you tell us a bit about a project that you're currently working on? One of the projects I'm working on is overhauling of our task management system. The core problem we're trying to solve is that our customers need a detailed overview and a flexible interface to control the annotation and review process for their data operations. The process can be complicated, involving multiple review stages and different levels of scrutiny. So we're developing a system to allow our clients to implement more complex workflows and be able to review the process even faster! In terms of how I spend my time between collaborating and individual development, it varies heavily by what point of the project we're in. At the beginning, a large part of my time is spent brainstorming, talking through the objectives with other engineers and product managers, and listening in on client calls. After gathering ideas and refining our approach, I spend most of my time coding and focusing on the project — while still staying in sync with the rest of the team so that we keep moving quickly in the same direction. Lastly, what advice would you give to someone considering working at Encord? Many people reach out to me each week asking this and there's a few things I think many people don't consider. Encord is obviously in a very exciting position — we're a strong team, in the right market (AI infrastructure), and have a clear vision of what the future will be like and what we need to build to get there. Yet early-stage startups, especially ones moving very quickly, require working-modes and dispositions that the vast majority of people are not looking for nor comfortable with — and that's okay! If you don’t particularly enjoy collaborating on tasks, or you prefer being told very precisely/prescriptively what to do, or you're not at the stage in your career where you want to own processes, then I'd say Encord (and similar-type companies) are probably not the best fit. It really depends on what excites you. Here, you need to embrace pace, ownership, collaboration, and autonomy, often to degrees you may not have considered possible. These are key traits that, I think, have all made us successful in our roles. If you want to own problem areas and find solutions, if you like collaborating and prototyping MVPs quickly to get client feedback early, and if you pride yourself in making the right decisions with limited amounts of information — then Encord may be a good fit! Thank you, Denis! We have big plans for 2023 & are hiring across all teams. Find here the roles we are hiring for!
Aug 22 2023
5 M
Meet Rad - Head of Engineering at Encord
At Encord, we believe in empowering employees to shape their own careers. The company fosters a culture of ‘trust and autonomy’, which encourages people to think creatively and outside the box when approaching challenges. We strongly believe that people are the pillars of the company. With employees from over 20 nationalities, we are committed to building a culture that supports and celebrates diversity. For us, we want our people to be their authentic self at work and be driven to take Encord's mission forward. Rad Ploshtakov was the first employee at Encord and is a testament to how quickly you can progress in a startup. He joined as a Founding Engineer after working as a Software Engineer in the finance industry, and is now our Head of Engineering. Hi Rad! Tell us about yourself, how you ended up in Encord, and what you’re doing. I was born and raised in Bulgaria. I moved to the UK to study a masters in Computing (Artificial Intelligence and Machine Learning) at Imperial College London. I am also a former competitive mathematician and worked in trading as a developer, building systems that operate in single digit microseconds. Then I joined Encord (or Cord, which is how we were known at the time!) as the first hire - I thought the space was really exciting, and Eric and Ulrik are an exceptional duo. I started off as a Founding Engineer and, as our team grew, transitioned to Head of Engineering about a year later. I am responsible for ensuring that as an engineering team we're working on what matters most for our current and future customers - I work closely with everyone to set the overall direction and incorporate values for the team. Nowadays, a lot of my time is also spent on hiring, and on helping build and maintain an environment in which everyone can do their best work. What does a normal day at work look like for you? Working in a startup means no two days are the same! Generally, I would say that my day revolves around translating the co-founders' goals into actionable items for our team to work on - communicating and providing guidance are two important aspects of my role. A typical day includes meeting with customers and prospects, code reviewing, and supporting across different initiatives. Another big part is collaborating with other teams to understand what we want to build and how we are going to build it. Can you tell us a bit about a project that you are currently working on? Broadly speaking, a lot of my last few weeks has been supporting our teams as they set out and execute on their roadmaps. 2023 will be a huge year for us at Encord, and we're moving at a very fast pace, so a lot of my focus recently has been helping us be set up for success. As for specific projects, I'm very excited about all the work our team is doing for our customers. For example, our DICOM annotation tool has recently been named the leading medical imaging annotation tool on the market - which is a huge testament to the work our team has poured into it over the last year. I remember hacking together a first version of our DICOM annotation tool in my first (admittedly mostly sleepless!) weeks at Encord, and seeing how far it's come in just a few months has been one of the most rewarding parts of my last year. What stood out to you about the Encord team when you joined? Many things. When I first met the co-founders (Eric & Ulrik), I was impressed by their unique insights into the challenges that lay ahead for computer vision teams - they can simultaneously visualize strikingly clearly what the next decade will look like, while also being able to execute at mind-boggling speed in the moment, in that direction. I was impressed also by how smart, resourceful and driven they were. By the time I joined, they had been able to build a revenue generating business with dozens of customers - getting to understand deeply the problems that teams were facing and then iterating quickly to build solutions that not even they had thought about. What is it like to work at Encord now? It's a very exciting time to be at Encord. Our customer base has been scaling rapidly, and the feedback loop on the engineering cycle is very short, so we get to see the impact of our work at a very quick pace which is exciting - often going from building specs for a feature, to shipping it, showing it to our customers, and seeing them starting to use it all happen in a span of just a few weeks. A big part of working at Encord is focusing a lot on our customer's success - we always seek out feedback, listen, and apply first principles to the challenges our customers are facing (as well as getting ahead with ones we know they'll be facing soon that they might not be thinking about yet!). Then work on making the product better and better each day. How would you describe the team at Encord now? The best at what they do - also hardworking, very collaborative and always helping and motivating each other. One of our core values is having a growth mentality, and each member of our team has come into the company and built things from the ground up. Everyone has a willingness to roll up their sleeves and make things happen to grow the company. A resulting factor of this is also that it's okay to make mistakes - we are constantly iterating and trying to get 1% better each day. We have big plans for 2023 & are hiring across all teams! ➡️ Click here to see our open positions
Mar 09 2023
3 M
How to Review Encord on G2
One of the most popular sites for researching business software is G2. It features reviews of any kind of software you can think of, allowing people to read reviews from their peers and get a better understanding of what the best software solution is for them. This also applies to the world of computer vision and machine learning. And for us at Encord, we really want the computer vision community to know how good Encord is. That’s why we’re asking Encord users (whether you’re involved in annotation, data operations or machine learning) to head over to G2 and leave us a review. Not only does it help us get in front of a wider group of people, but it also helps the machine learning and computer vision community find out about Encord. Leaving a review doesn’t take long - follow the steps below and you’ll have your review up in no time! Step 1: Registration Follow this link and then click on the blue button that says ‘Continue to Login’ You’ll be asked to sign in. If you’ve got a G2 account, use that. Otherwise, click on ‘Business Email’ or LinkedIn to create your account On the next screen enter your first name, last name, choose a password and press ‘Create an Account’. You’ll be sent to a page where you need to enter a verification code. This code will be sent to your email address. You need to go into your email, find the email with this code and then come back to this page to enter the code. Once you’re in G2 you’ll need to set up your profile. Then you’ll be able to carry on submitting your review (if you need to come back to it, use this link to start the review process again) Step 2. Submitting Your Review Rate Encord on a scale of 1 - 10. These reviews are really important to us, so if you don’t feel we rate a 9 or a 10, please reach out to us first so we can address whatever problems you’re experiencing on the platform. Then add a short title for your review and fill out the sections ‘what do you like best’ and ‘what do you dislike’. (Take a look at these positive reviews for inspiration). You only need to enter 40 characters, but more is definitely better! And if you have any major dislikes, please speak to us first - we might be able to help or show you a feature you’ve missed Select your primary role when using Encord Once you’ve filled out these sections the button in the bottom right will turn green. Click on this. You’ll then see the following screen The more categories you select for the ‘For which purposes do you use Encord’ question, the more questions you’ll have to answer. So if you’ve only got time for one, please select ‘data labeling’. For ‘What problems is Encord solving…’ you need to enter 160 characters - about two sentences. For the next questions, if you don’t think we’re headed in the right direction or if you don’t think we deserve a 6 or 7, please talk to us. As with the other questions, we can work together to address whatever issue you’re having with ease of use or support. Then click on the button in the bottom right once it turns green On the next page, there’s a couple of obligatory questions (with the red asterisk). The most complicated one is ‘Are you a current user of Encord?’. Your answer to this should be yes. You’ll then need to show that you’re a current user of Encord by uploading a screenshot of your account in Encord. For the screenshot, you need to open up a new window in your browser and go to https://app.encord.com/settings (it may prompt you to login to the platform). It should look like this (in dark mode) Then take a screenshot of this screen. There’s two ways to do this depending on whether you’re using a PC or Mac. PC INSTRUCTIONS 1. Go to the Start menu, search for ‘Snipping Tool’ and click on the result 2. Once you’re in the Snipping Tool press click the ‘new’ button in the top left 3. Draw around the whole of the ‘Settings’ screen in Encord (see example screenshot below) 4. Then click the three dots in the top right corner of the snipping tool and save the snip to your desktop (or anywhere else you can find it) MAC INSTRUCTIONS 1. Press and hold these three keys together: Shift, Command, and 4. 2. Drag the crosshair to select the area of the screen to capture and let go 3. The image will be saved to your Mac’s desktop 4. To upload the screenshot you’ve taken click on the blue button that says ‘Upload Screenshot’ and find the screenshot on your computer Once you’ve uploaded your screenshot, you can answer the remaining optional questions (there’s one obligatory question at the bottom). Once you’ve answered all the obligatory questions and uploaded your screenshot, the button in the bottom right will turn green. On the next screen, there’s a couple of obligatory questions, and the rest are optional. Company name is one obligatory question. Please make sure you enter the name of the company you’re using Encord in the question ‘At which organization did you most recently use Encord?’ and then put the company’s website address in the question below so that G2 can find your company. The next obligatory question is ‘What is your industry when using this product?’. Just start typing in here and you’ll see a list of industries that you can select from. There’s three obligatory questions at the bottom of the list - they’re self-explanatory but need to be answered before you can submit your review. Once all the required questions have been answered, you’ll see two buttons appear in the bottom right. You don’t need to click on the green one, but can click on the white one that says ‘Submit My Review’. That’s it, you’re done! We really appreciate you taking the time to review Encord to tell the rest of the world how great the platform is. Even one positive review really helps us, so from the whole Encord team, thank you!
Feb 14 2023
5 M
Closing the AI Production Gap With Encord Active
Recently there was a tweet by Paul Graham showing an old cover from Maclean’s magazine highlighting the “future of the internet”. Graham poses the accompanying question: “What do 36 million people use now that eventually 5 billion people will?” At Encord we have long believed the answer to this question to be artificial intelligence, as AI feels to be at a similar inflection point now as the internet was in the 90s, poised to take off for widespread adoption. While this view that AI will be the next ubiquitous technology is not an uncommon one, its plausibility hasn’t been as palpable as recently with the imagination-grabbing advancements in generative AI over the last year. These advancements have seen the rise of “foundational models”, high capacity unsupervised AI systems that train over enormous swaths of data and take millions of dollars of GPU power doing it. TLDR; Problem: There is an “AI production gap” between proof-of-concept and production models due to issues with AI model robustness, reliability, explainability, caused by a lack of high-quality labels, model edge cases, and cumbersome iteration cycles for model re-training. Solution: We are releasing a free open-source active learning toolkit, designed to help people building computer vision models improve their data, labels, and model performance. Introduction The success of foundational models is creating a dynamic of duality in the AI world. With foundational models built by well-funded institutions with the GPU muscle to train over an internet’s worth of data and application-layer AI models normally built from the traditional supervised learning paradigm requiring labeled training data. While the feats of these foundational models have been quite impressive, it is quite clear we are still in the very early days of the proliferation and value of application-layer AI, with numerous bottlenecks holding back wider adoption. We started Encord a few years ago originally to tackle one of the major blockers to this adoption, the data labeling problem. Over the years, working with many exciting AI companies, we have since enhanced our views on the blockers for later-stage AI development and deploying models to production environments. Over this post, we will discuss our learnings from thousands of conversations with ML engineers, data operations specialists, business stakeholders, researchers, and software engineers and how it has culminated in the release of our newest open-source toolkit, Encord Active. The Problem: Elon’s promise There is a famous Youtube sequence of Elon Musk promising Tesla’s delivery of self-driving cars, every “next year”, since 2014. We are now at the start of 2023, and that promise still seems “one year” away. This demonstrates how the broader dream of realizing production AI models that have transformative effects over the real world (self-driving cars, robot chefs, etc) has been slower to materialize than expected given early promise and floods of investment. This is a multi-faceted and complex issue coupled with societal structures outside the tech industry (regulators, governments, industry, etc.) lagging appreciation of associated implications that come from the second-order effects of adopting this technology. The more pernicious problem, however, is one which lies within the technology itself. Promising proof-of-concept models which perform well on benchmarked datasets in research environments have often struggled when in contact with real-world applications. This is the infamous AI production gap. Where does this gap come from? The AI Production Gap One of the main issues is that the set of requirements asked of AI applications rises precipitously when in contact with a production environment: robustness, reliability, explainability, maintainability, and much more stringent performance thresholds. An AI barista making a coffee is impressive in a cherry-picked demo video, but frustrating when spilling your Frappuccino 5 times in a row. As such, the gap between “that’s neat” and “that’s useful” is much larger and more formidable than ML engineers had anticipated. The production gap can be attributed to a few high-level sub-components. Among others: Slow shift to data-centricity: Working with many AI practitioners, we have noticed a significant disconnect between academics and industry. Academics often focus on model improvements, working with fixed benchmark datasets and labels. They optimize the parts of the system that they have the most control over. Unfortunately, in practical use cases, these interventions have lower leveraged effects on the success of the AI application than taking a data-centric view. Insufficient care has been placed on data-centric problems such as data selection, data quality improvement, and label error reduction. While not important from a model-centric view with fixed training and validation datasets, these elements are crucial for the success of production models. Lack of decomposability: A disadvantage of deep learning methods compared to traditional software is the lack of being able to take it apart in pieces for examination. Normal (but highly complex) software systems have composable parts that can be examined and tested in independent ways. Stress testing individual components of a system is a powerful strategy for fortifying the entirety. Benefits include the interpretability of system behavior and the ability to quickly isolate and debug errant pieces. Deep neural networks, for all their benefits, are billion parameter meshes of intransparency; you take it as is and have little luck in inspecting and isolating pieces component-wise. Insufficient evaluation criteria: Exacerbating the lack of decomposability are the insufficient criteria we have to evaluate AI systems. Normal approaches just take global averages of a handful of metrics. Complex high-dimensional systems need sophisticated evaluation systems to meet the complexity of their intended domain. The tools to probe and measure performance are still nascent for models and almost completely non-existent for data and label quality, leaving a lack of visibility into the true quality of an AI system. The above problems (and the lack of human-friendly tools to deal with them) have all contributed in their own way (again among others) to the AI production gap. At Encord, we have been lucky to see how ML engineers across a multifaceted set of use cases have tackled these issues. The interesting observation was that they used very similar strategies even in very varied use cases. We have been helping these companies now for years, and based on that experience we have released Encord Active, a tool that is data-centric, decomposable, human-interaction focused, and improves evaluation. How It Should Be Done Before going into Encord Active, let’s go over how we’ve seen it done by the best AI companies. The gold standard of active learning are stacks that are fully iterative pipelines where every component is run with respect to optimizing the performance of the downstream model: data selection, annotation, review, training, and validation are done with an integrated logic rather than as disconnected units. Counterintuitively, the best systems also have the most human interaction. They fully embrace the human-in-the-loop nature of iterative model improvement by opening up entry points for human supervision within each sub-process while also maintaining optionality for completely automated flows when things are working. The best stacks are thus iterative, granular, inspectable, automatable, and coherent. Last year, Andrej Karpathy presented Tesla’s Data Engine as their solution to bridge the gap, but where does that leave other start-ups and AI companies without the resources to build expensive in-house tooling? Source: Tesla 2022 AI Day Introducing Encord Active Encountering the above problems and seeing the systems of more sophisticated players led us through a long winding path of creating various tools for our customers. We have decided to release them open source as Encord Active. Loosely speaking Encord Active is an active learning toolkit with visualizations, workflows, and, importantly, a library of what we call “quality metrics”. While not the only value-add of the toolkit, for the remainder of the post, we will focus on the quality metric library as it is one of our key contributions. Quality Metrics Quality metrics are additional parametrizations added onto your data, labels, and models; they are ways of indexing your data, labels, and models in semantically interesting and relevant ways. They come in three flavors: It was also very important that Encord Active gives practical and actionable workflows for ML engineers, data scientists, and data operations people. We did not want to build an insight-generating mechanism, we wanted to build a tool that could act as the command center for closing the full loop on concrete problems practitioners were encountering in their day-to-day model development cycle. The way it works Encord Active (EA) is designed to compute, store, inspect, manipulate, and utilize quality metrics for a wide array of functionality. It hosts a library of these quality metrics, and importantly allows you to customize by writing your own “quality metric functions” to calculate/compute QMs across your dataset. Upload data, labels, and/or model predictions and it will automatically compute quality metrics across the library. These metrics are then returned in visualizations with the additional ability to incorporate them into programmatic workflows. We have adopted a dual approach such that you can interact with the metrics via a UI with visualizations, but also set them up in scripts for automated processes in your AI stack. With this approach, let’s return back to the problems we had listed earlier: Slow shift to data-centricity: EA is designed to help improve model performance among several different dimensions. The data-centric approaches it facilitates include, among others: Selecting the right data to use data labeling, model training, and validation Reducing label error rates and label inconsistencies Evaluating model performance with respect to different subsets within your data EA is designed to be useful across the entire model development cycle. The quality metric approach covers everything from prioritizing data during data collecting, to debugging your labels, to evaluating your models. The best demonstrations are with examples in the next section. Decomposability: Until we have better tools to inspect the inner workings of neural networks, EA treats the decomposability problem by shifting decomposability both up and down the AI stack. Rather than factoring a model itself, quality metrics allow you to very granularly decompose your data, labels, and model performance. This kind of factorization is critical for identifying potential problems and then properly debugging them. Insufficient evaluation criteria: As a corollary to the above EA allows for arbitrarily many and arbitrarily complex quality metrics to evaluate the performance of your model. Importantly, it breaks down the model performance as a function of the quality metrics automatically, guiding users to the metrics that are likely to be most impactful for model improvement. Until we have both AGI AND the AI alignment problem solved, it remains critically important of keeping humans in the loop for monitoring, improvement, development, and maintenance of AI systems. EA is designed with this in mind. The UI allows for quick visual inspection and tagging of data, while the programmatic interface allows for systematization and automation of workflows discovered by ML engineers and data operations people. Choose Your Own Adventure: Example Use Cases Data selection: With EA you can run your previous model over a new dataset and set the inverse confidence score of the model as a quality metric. Sample the data weighted by this quality metric for entry into an annotation queue. Furthermore, you can use the pre-computed quality metrics to identify subsets of outliers to exclude before training or subsets to oversample. You see a tutorial on data selection using the TACO dataset here. Label error improvement: You can use the “annotation quality” metric, which calculates the deviation of the class of a label from its nearest neighbors in an embedding space to identify which labels potentially contain errors. This additionally breaks down label error with respect to who annotated it, to help find annotators that need more training. If you upload your model predictions you can find high-confidence false positive predictions to identify label errors or missing labels. Model performance analysis: EA automatically breaks down your global model performance metrics and correlates them to each quality metric. This surfaces which quality metrics are important drives in your model performance, and which potential subsets of the data your model is likely to perform worst in going forward Why Open Source There was an observation we made working with late-stage AI companies that prompted us to release Encord Active open source. Many of the metrics companies use are often common, even for completely different vertical applications. One of the strategies of a startup is to reduce the amount of redundant work that is being done in the world. Before Git’s common adoption, every software company was developing its own internal version control software. This amounted to tens of thousands of hours of wasted developer time that could be allocated to more productive activity. We believe the same is being done now with quality metrics for ML engineers. Open sourcing Encord Active will remove the pain of people using notebooks to create redundant code and one-off scripts that many others are also developing and free up time for ML engineers to focus on improving their data and models in more interesting ways. As a new open source tool, please be patient with us. We have some of the leading AI companies in the world using Encord Active, but it is still very much a work in progress. We want to make it the best tool it can be, and we want it out in the world so that it can help as many AI companies as possible move forward. If it works, we can in a small way contribute to one of the hardest things in the world: making an Elon Musk promise come true. Because AI delayed is not AI denied. Want to test your own data and models? “I want to start annotating” - Get a free trial of Encord here. "I want to get started right away" - You can find Encord Active on Github here or try the quickstart Python command from our documentation. "Can you show me an example first?" - Check out this Colab Notebook. If you want to support the project you can help us out by giving a Star on GitHub ⭐ Want to stay updated? Follow us on Twitter and LinkedIn for more content on computer vision, training data, and active learning. Join the Slack community to chat and connect.
Jan 23 2023
5 M
Tech Startups are not Just About Tech: Relationships Matter too
Introduction People often think that a technology company consists of…technology. Every day, subject-matter experts work together to build cutting-edge tech that will change lives and put their businesses ahead of the competition. Build great tech, and you will build a great tech company - easy and simple. This is, however, only a small part of the story. What often gets lost is that a technology company (or any company for that matter) consists of a set of relationships. Maintaining and servicing these relationships on a regular basis is just as (or even more) important as maintaining and servicing the technology you build. The relationships between the people who build the technology, the people who build the business, and the people who invest in the company are critical to whether the company succeeds or fails. During Y Combinator, I spent a lot of time “doing things that don’t scale.” As a founder, you’re hands-on in everything, including daily problem solving for continuously evolving problems. After YC, however, when we began scaling Encord, I learned (partly because a mentor explained it to me) that as a founder, I needed to focus mostly on just two things. One, ensuring that the business has enough runway – whether from fundraising or revenue or both – to keep going, and two, finding and retaining great people. Ensuring a continuous inflow of funds and great people are two tasks that actually have a lot in common. Specifically, they both depend on building and maintaining relationships. So the question then becomes, ‘how do you build relationships?’ Remember you are in for the long haul A lot of founders think that when an investor sends over a funding check, that’s the end of the relationship. In reality, that check marked the start of the relationship, and now it’s time to maintain and further cultivate it. Investors want to help. They want your company to be successful. Don’t take them for granted. Instead, make sure you keep them in the loop and hold yourself accountable to them by doing your best work and remaining open to feedback. If you maintain and nurture your investor relationships, those firms are likely to help you in future funding rounds. While investor relationships are important, ultimately, a company should gain its funds from customers and not VCs. Buyers are the key to success, so you need to make sure your business solves a real problem, makes people happy, and creates a product or service that people want. A philosophy we learned to follow during YC and continue to follow now is to “bear-hug” your customers. If you have a product or service that people want, and you bear hug your customers, your business will grow and generate revenue. The parable of the elephant applies to your business When building the strategy for a business, you need to stay close to the end users. To understand what your customers need and where your business fits within the larger ecosystem, you need to develop relationships with a variety of stakeholders. Hypotheses for solving customer problems don’t come from reading academic papers or journals but from talking to people. We talked to more than a thousand stakeholders, and we built our business thesis from what we learned in those conversations. Predicting the distant future is difficult, so we think about how to combine more immediate customer needs with the next logical technological steps forward. Our data quality assessment tool is a great example of this synthesis, because we heard the same problem from many stakeholders, and we realised that we could build on our existing technology to solve that problem. We always continue to talk to customers, prospective customers, AI practitioners, academics, big tech companies, corporates, researchers, clinicians, and other medical professionals. We talk to a diverse array of people because each type of person has a different view of Encord and the types of problems our technology can solve. Much like the parable of the blind men and the elephant, different stakeholders focus on different parts of our product. One touches the tusk, while another touches the trunk. These groups might only know their part of the structure, and it's our job to combine their varying perspectives and subjective experiences to create a coherent structure and vision for the technology that drives the business forward. We always have to keep our eyes on the whole elephant. Relationships are critical to finding and retaining talent Without a doubt, having great people on your team is the most important factor in making the company successful. Unfortunately, there’s no shortcut or industry secret for hiring great people. It’s a grind. As an early-stage startup, don’t expect good people to just come to you. You’ve got to invest the time and resources into finding great people from the start. Search for people, and refine your judgment and intuition about the type of people who will thrive in your company. Learn from the mistakes you make. Remember that the hiring process is not a one-way street. Yes, you interview people, but they also interview you. You need to sell the company and its vision. Generating excitement about the work your company is doing is important to attract people who believe in its mission. You can’t sit on your laurels and think that the most talented people will choose your company over another offer. Once you’ve hired people, share the context with new employees so they can solve problems on their own, but don’t micromanage them. Our philosophy is if you’ve hired people you have to micromanage, then you’ve hired the wrong people. You should be happy to let new employees solve problems autonomously. Take steps to build relationships with each employee and ensure that they understand their importance within an early stage company. The odds are always against you as a startup, so just having talented people is often not enough. You also need them to be bought in. These earliest hires need to know and understand that they have a great impact on the trajectory of the company. That’s the best way to get the best from your employees. Your problems are not unique. Leverage the experience of your network. Remember, your company is not a unique snowflake within the larger business world. Many founders, including myself, we begin to think that we’re on our own, struggling through a sea of never-before-encountered problems. In reality, company leaders out there have already encountered and endured every issue you are facing. In fact, some leaders have encountered and endured them several times. In all likelihood, 80 percent of the issues you’re facing are well-trodden, while only 20 percent are company-specific. If you keep up the connections that you have, you’ll be able to lean upon a vast amount of industry expertise and business knowledge to tackle that 80 percent. Stay in contact with your mentors and your friends. At Encord, we still talk to our YC group partners. They're extremely busy, but they are an incredibly useful resource. Having friends and mentors who have experienced the things that you're experiencing and have already solved the problems that you’re trying to solve is immensely valuable, so embrace your network. When you encounter a problem, ask if anyone else has run into the same one. You’ll find that your network is a hive for answers that can save you time and money. Ready to automate and improve the quality of your data labeling? Sign-up for an Encord Free Trial: The Active Learning Platform for Computer Vision, used by the world’s leading computer vision teams. AI-assisted labeling, model training & diagnostics, find & fix dataset errors and biases, all in one collaborative active learning platform, to get to production AI faster. Try Encord for Free Today. Want to stay updated? Follow us on Twitter and LinkedIn for more content on computer vision, training data, and active learning. Join our Discord channel to chat and connect.
Nov 11 2022
5 M
Encord: The SOC 2 Compliant Platform for Computer Vision
As a proponent of data-centric AI and a leader in computer vision infrastructure and labeling software, we’re well aware of the potential pitfalls and challenges companies face during the machine learning application lifecycle. Anyone looking to deploy machine learning applications in non-trivial cases has to successfully navigate the various regulations and laws surrounding data management and where models trained on that data can be deployed — you can find a fuller set of considerations in our blog. In other cases, highly proprietary or sensitive data may be used in training and validation. In all cases, you need to know your partner in annotation, and quality control takes security and compliance with regulatory frameworks as seriously as you do. To that end, Encord is excited to announce we’ve successfully completed our SOC2 Type 1 examination as of July 8, 2022. This report affirms our commitment to the highest security and operational standards and our further commitment to communicate this with our users and customers. We’ll explain how this report illustrates our strong security posture and how it makes it easier for us to work with anyone deploying machine learning applications using our computer vision data platform. Contents of the SOC2 report This report is a confirmation that Encord is committed to the highest possible industry standards from both a system and service perspective according to the Systems and Organisational Control (SOC) criteria in Security, Availability, and Confidentiality. The report was authored by a rigorous, independent auditor and explains in detail our: - Secure architecture and policies for limiting access to data — The production infrastructure at Encord is isolated from all other networks at the company, and access is limited to only those who need it in their professional capacity. - Continuous monitoring of security controls — We use tools to help us continuously monitor system configurations and access settings to ensure we’re always maintaining a strict and compliant security posture. - Continuous monitoring of systems and incidence response policy — From the perspectives of both security and availability, Encord uses best practices, tools, and systems to ensure we’re informed of any abnormalities and positioned to investigate and respond appropriately quickly. - Thorough Employee Onboarding Process — We have a stringent process for employee interview, selection, and onboarding, including confirming candidates’ backgrounds and security training for onboarded employees, ensuring high integrity and security consciousness in the entire company. Why is SOC2 important? We’re a young startup — but we’re acutely aware of how important security and regulatory compliance are to the success and well-being of our customers and potential customers. We’re announcing SOC 2 Type 1 now to show we’re serious about security and compliance and to make it easier for our customers to understand our security posture and take comfort knowing we’re committed to best practices. Additionally, we view the SOC 2 Type 1 report as the first step in our continuous journey to enhance our security and operational excellence. As we move towards SOC 2 Type 2 compliance, adhering to the framework will ensure we have security; knowledge of regulatory frameworks plays a continuous role in growth going forward. To see more information about how we’re addressing the needs of our customers when it comes to security, take a look at our security page. If you’re a customer or potential customer, reach out to us about obtaining our full SOC 2 Type 1 report and to discuss how we can help meet your training and labeling needs while maintaining your own security and compliance. Ready to automate and improve the quality of your data labeling? Sign-up for an Encord Free Trial: The Active Learning Platform for Computer Vision, used by the world’s leading computer vision teams. AI-assisted labeling, model training & diagnostics, find & fix dataset errors and biases, all in one collaborative active learning platform, to get to production AI faster. Try Encord for Free Today. Want to stay updated? Follow us on Twitter and LinkedIn for more content on computer vision, training data, and active learning. Join our Discord channel to chat and connect.
Nov 11 2022
4 M
Providing Computer Vision Infrastructure for Project Stormcloud
Last month, Encord was one of a number of global tech companies invited by Amazon Web Services (AWS) to attend the event dubbed Project Stormcloud. In launching Project Stormcloud, The Royal Navy’s Office of the Chief Technology Officer challenged global technology giants Microsoft and AWS to demonstrate how companies could bring new, state-of-the-art cloud-based technology into the defence industry. As part of the Stormcloud Community, we’ve been supporting the Royal Navy and British Defence by providing critical computer vision infrastructure for the project, enabling the defence industry to automate visual tasks, annotate data for internal intelligence analysis, and store data at a large scale. This support allows for the application of AI for an instant real-time, on-the-ground intelligence picture. Being involved in Project Stormcloud has been a great experience for us as a company. It has been a privilege to be part of this consortium of innovators. We got to work and integrate with some of the leading tech companies in the government sector. The fact that the UK tech ecosystem could achieve so much in such a short period of time really speaks volumes about its quality. We also gained a lot of insight into the importance of using the right data to achieve specific mission objectives. It was useful to learn how our applications can be used to achieve real-time situation awareness. Stormcloud, with AWS, Microsoft, and their range of partners, will progress further over the next year to incorporate ideas from across Defence and to demonstrate how two of the leading global tech companies can revolutionize how to get technology into the hands of sailors and Royal Marines. We look forward to continuing to be part of the journey. Ready to automate and improve the quality of your data labeling? Sign-up for an Encord Free Trial: The Active Learning Platform for Computer Vision, used by the world’s leading computer vision teams. AI-assisted labeling, model training & diagnostics, find & fix dataset errors and biases, all in one collaborative active learning platform, to get to production AI faster. Try Encord for Free Today. Want to stay updated? Follow us on Twitter and LinkedIn for more content on computer vision, training data, and active learning.
Nov 11 2022
2 M
How We Raised a $4.5M Seed Round
My co-founder Eric Landau and I recently closed a $4.5M seed round for our company Encord, focused on building software to automate data annotation for computer vision, led by CRV and including the Y Combinator Continuity Fund, WndrCo, Crane Venture Partners, and the Harvard Management Company. I have never been a skilful storyteller, fairy tales being my least preferred form of prose. This may come as a surprise since I am Danish and one of the most famous writers to come out of Denmark is Hans Christian Andersen of Little Mermaid fame. However, many composers of fairy tales are venture capital investors, and I am a tech founder. I knew I would have to up the ante as my co-founder and I embarked on our maiden fundraising journey during the depths of England’s third national lockdown. Our seed fundraise marked the culmination of more than a year of uncertainty, hard work, and an infinite amount of hustle; I had to make sure our story inspired the master storytellers who heard our pitch. Not my strong suit… Blinded by Venture Dollars We first started thinking about raising money not long after we decided to start our company. Blinded by what seemed to be a constant stream of enormous venture financings in the early days of 2020 — back when COVID-19 press coverage was still almost exclusively concentrated on the Wuhan lockdown — we thought we’d have an easy time convincing investors to give us money to get our startup off the ground. Then COVID hit, which at the time seemed to mark the beginning of the end of free-flowing venture dollars. Knowing that we’d fail to gain any traction with investors preoccupied with ensuring that their portfolio companies had sufficient liquidity, we were forced to rethink our fundraising plans. With the wisdom of hindsight, it is clear that we would have failed in any environment. Pre-revenue seed financings are becoming increasingly scarce — two-thirds of companies closing a seed round in 2019 were generating revenue, up from just 9% in 2010. We hypothesised that the collapse in venture funding would be transitory, and that if we could make it ‘over the hump’ we’d be in a good spot, as fewer startups would be raising money by virtue of not having made it through the pandemic. Whether or not that turned out to be true is anyone’s guess, but it was the best thing that could have happened to us. Instead of wasting hours on end crafting pitch decks, cold emailing investors, and theorising about product-market fit, we started building a business. Fast forward a couple of months, we were fortunate enough to be accepted into Y Combinator’s W21 batch. COVID-19 slashed the number of VC rounds in the U.S. by 44 percent The Art of the Pitch My co-founder Eric Landau chronicled our YC experience in the heat of battle, so I won’t dive into that here. Instead, I’ll focus on our fundraising experience which took place around the W21 Demo Day. Investors started reaching out to us as soon as we had publicly launched on the YC company directory, and we had to try hard to resist and fend off the very creative ways angels and funds tried to get us to pitch them (indeed, for that brief period of time it felt like the tables were turned in our favour). We knew that we had to master our pitch first, utilizing tools like a portfolio visualizer to strategically present our projected growth and market capture potential to investors. Going to back to my fairy tale analogy, I want to dwell a bit on what a pitch really is, and what might constitute a good one: From first principles, a pitch is a story about the future. We told a story about how Encord would become a multi-$billion company. Things like traction, product, and a well-defined vision are all reflections of how we might come to realise that story. The better the evidence, the better the story — or, when you are a seed stage startup, most likely a fairy tale — tends to be. After months of obsessing about growth and product-market fit, we had made substantial progress on our key metric (monthly recurring revenue, or MRR for short) and, with the help of our YC group partners, felt like we had a story to match Jack and the Beanstalk. If you are reading this as an investor and have ever wondered what it’s like being on the receiving side of demo day, here is an example of what my inbox looked like 5 minutes after our pitch: I had never received 77 emails in any 5-minute period in my life prior to Demo Day Our First Fundraising Lessons Being a London-based startup, we had the good fortune of having access to both the US and European venture ecosystems. Pitches were done in random order across the two, and we pitched to more investors from the US than Europe. Here’s how we did: We were introduced to about 145 investors (angels, angel funds, and other funds of various sizes) through Y Combinator, personal networks, and other inbound We met with 52 investors over a total of 70+ meetings 18 of those investors were European; 34 were American Americans liked us more — our ‘yes’-rate from American investors was almost double that of European investors The ‘ghosting’ rate was about equal — a little over half of both Americans and Europeans didn’t say ‘yes’ or ‘no’ Getting to the second meeting increased our chances of a yes by 60% compared to our baseline On average, a European ‘yes’ required 1 additional meeting compared to a US ‘yes’ A few things stood out to us as we were compiling these statistics. The first is the startling disparity between our success rate with US compared to European investors. The second is the lack of committed decisions — just a little over half of the investors we spent time with even bothered to get back to us (writing this made me do a few searches on ‘VC ghosting’, and I stumbled on a a thread worth a read on Hacker News). When a reply is the low bar an investor has to meet, helpful feedback stands out. We would recommend the few investors who shared detailed comments with us to any founder, even though they said ‘no’ to us. Zooming in on our US/Europe success rate, a number of factors might explain the disparity: Y Combinator is better known in the US; more of the US investor leads came from inbound rather than outbound; we ourselves were biased in prioritising a speedier process and ‘doing a better job’ with US investors because we knew they could write larger checks; and/or the sample size is too small to infer anything at all. Competition versus Big Markets It is also worth touching on the types of conversations we had with investors from both sides of the pond. These are generalisations, of course, but nonetheless patterns that we recognised. Broadly speaking, US investors cared more about product, go-to-market strategy, and the absolute size of whatever market you are targeting. European investors cared more about the competitive landscape, and differentiation. I personally think that the European obsession with competition and differentiation is why our ‘yes’-rate from the Americans was higher. They generally agreed that the market for AI training data creation and management — still an emerging and unsophisticated software stack— is going to be massive and thus produce multiple big exits over the next 5–10 years. Jotting down these reflections, I can’t help but think about Peter Thiel’s observations on competitive markets (from Zero to One): Suppose you want to start a restaurant that serves British food in Palo Alto. “No one else is doing it,” you might reason. “We’ll own the entire market.” But that’s only true if the relevant market is the market for British food specifically. What if the actual market is the Palo Alto restaurant market in general? And what if all the restaurants in nearby towns are part of the relevant market as well. …If you lose sight of competitive reality and focus on trivial differentiating factors your business is unlikely to survive. Perhaps that’s why the European venture ecosystem is lagging the American. One can only guess. Investor Considerations We realise that many founders, particularly from underrepresented or disadvantaged backgrounds, do not necessarily have a choice between investors. But if you do, you might find inspiration in some of the things that helped us narrow in on the investors that not only understood our space, but also could help us become a multi-$billion company. Seeing that we are still early, we only have sparse evidence of whether they work or not. They are: If you are fortunate to get a term sheet, don’t only do the reference checks with people they introduce you to — do reference checks with founders who had less successful outcomes Don’t underestimate how an investor makes you feel — if they are dismissive, talk a lot, interrupt you, or are distracted, there is a good chance they’ll do that during board meetings too Investors are buying a piece of your company — it’s an irreversible marriage you can’t get a divorce from Don’t be afraid to ask investors about their investment thesis and what they think of your space — the best pitches feel like a conversation, and what they think your business should be might be fundamentally misaligned with what you have in mind A venture firm, particularly a newer one, can be less known, but the partner an ideal advisor — conversely, a firm might be a good fit, but the partner not Good luck and godspeed!
Nov 11 2022
7 M
What I Learned From My First Month at Y Combinator
For context, I am a cofounder of Encord, a company in the current winter 2021 batch. We are focused on building software to improve data labelling for computer vision. Before writing this I reviewed as many posts I could find(without having to go past the first page of Google) from people writing about their YC experience. There were fewer than I expected. Most people talk about interviewing and how to get in, which is probably the wider point of interest. I didn’t want this post to be redundant, so I decided to avoid talking about anything that collided with the bulk of the points from these posts, and there are many. Think of this as a second order approximation to lessons from YC. Word cloud from previous YC posts I reviewed First Order Lessons I’ll go ahead and summarise the top lessons from the other posts here, but you should read them. I’ve included the list at the bottom. They are well written, and the lessons are more generally applicable than in this post. They are (roughly): Y Combinator is not like school, there is no set track to follow You get out what you put in so work hard and embrace the grind Think big, aim high Maximise your use of the resources available to you, mostly your peer and mentor networks Get feedback early and often from your users Do things that don’t scale Advice is often contradictory, use your own judgment I want to dwell on the last point a little bit. My opinion is that advice should come with error bars. You have to weigh the advice you get by the distance of the advisor to your particular situation. If I asked Hafþór Júlíus Björnsson what the path to success looks like, he would probably tell me it’s in lifting weights. That would…not be the best way for me. My startup mentor At YC you get a lot of advice, but I find that the usefulness of advice is inversely proportional to the size of the audience that is receiving it. Advice from batch-wide events is not as relevant as what we get from our group partners in office hours. Think of the error bars as shrinking with how directed it is and also by how many times you have heard it from independent sources. With that said, take everything I write here with error bars, large ones. The lessons are both a mix of what I have heard explicitly from YC speakers and conclusions from my own experiences during the last month. They are both abstract and actionable. They also could just be wrong. The flip side is that most of this might be obvious. It just wasn’t to me. I could have been missing some basic stuff that it only took YC to elucidate. Sales is Search I don’t know why it took me so long to realise, sales is not (mostly) a persuasion problem, it is (mostly) a search problem. It is about finding the person that has the problem you are trying to solve. Ads in that sense are like autonomous search agents rather than units of persuadability. This is not SaaS sales As a startup you don’t have a much better search algorithm than just brute forcing it. So you need to send emails. A lot of emails. I had an idea for this before YC, but I was an order of magnitude off. Instead of sending dozens of emails a week, you have to send hundreds. The corollary to this is that you want to make it easy for people to find you. Inbound is much more efficient than outbound from a sales conversion perspective because the person that is reaching out has already passed a bunch of filters that an outbound contact has not. This means that you should launch your product early, and launch it often. You want to give people a chance to find you. Actionable: Get your product out there for people to easily find. Otherwise, up your email numbers by an order of magnitude. Look for No Time is precious, so a no is valuable. It’s why Nigerian scammers make their emails look fake. It’s because if you fall for it, you are likely gullible enough to fall for the entire scam. That saves them a lot of time wading through false positives. For a startup, a “no” both frees your time and provides a learning opportunity. It tells you what your product is missing and who your true customers are. A “yes”, on the other hand, is not a singular event. In SaaS sales “yes” is actually a funnel of yeses(plural yeese?) that you have to elicit to eventually end in a contract. If you are not making progress moving down this funnel, then you are probably wasting your time. A “no” saves you from having to figure that out for yourself. Thus, the order of preferred customer response is: yes > no > don’t respond > maybe. Yes is best, but no is for pros. Actionable: Try to get a commitment from customers one way or the other and don’t make your product something it is not. Accept the no and move on. You are Boring the Customer Clients don’t care about your product, they care about the solution to their problem. Show the clients how your product is that solution. That means listening to them. Before this month, I was talking a lot during customer calls and demos. I was trying to show as many features as I could from the product. It is natural to do that, because you personally find it cool how you solved all the difficult problems to get these features working. You think that if you show them a bunch of neat bells and whistles, they will be more impressed with the product. But the truth is that the customers just want to know how you can help them, and not much else. You don’t have to show them a feature that you think is awesome if it is not relevant to them. Otherwise you run the risk of boring them, which I now realise I was probably doing. Actionable: Talk 40%* less in demos. YC Relativity Not much to add on the personal side over what most posts have already said. The only interesting tidbit is our relationship with time. On one side you are so busy and working such long hours that it seems like every day just drags on endlessly. On the other hand, you constantly feel like you are behind and have to catch up with everything you have to do. You are racing against the week, but it is taking forever. The most apt analogy is the feeling you get on the last 100 meters of a 400 meter sprint. You are moving the fastest you can, but you feel very slow and weighed down. There is another related phenomenon that is akin to time dilation in special relativity, a disconnection of subjective time from calendar time. You are in a startup that is a rocket moving through “traction space”, where your subjective feeling of progress is benchmarked by changes there rather than the days of the week. The faster the ship goes, the more you become dislocated from the external contextual markers that feedback into your normal perception of time. Combining all that together and the last month has felt like a day, and a year. Failure > Success One thing I found from listening to all the talks is that I learn more from the stories about failure than the ones about success. Success stories are for inspiration, but failure is for education. The most effective talks are the melding of the two, where people talk either about how their failure turned into success or their success turned to failure. They give a nice A/B test over someone’s career. The problem I have with pure success stories goes back a bit to error bars on advice. They are rare cases, likely not statistically significant and difficult to reverse engineer. It’s like trying to figure out how to get rich from talking to a bunch of lottery winners; you will probably come away with a lot of errant conclusions. Not to say people that have built great startups are only just lucky, just that I am suspicious of extrapolating lessons from them because we are not seeing the rest of the sample. A much more informative frame is to talk to a bunch of lottery players. Only then do you learn that playing the lotto is a terrible idea(like starting a startup). The other reason is that I feel like it is easy for someone successful to mis-attribute the determinants of their own success. For one, they are probably still associated to the entity that made them successful so they might be attached to a narrative and might not have very clear incentives. There is also not that much reason for them to truly understand it, because things are working. In contrast, a smart, introspective founder that has undergone failure is under a lot more evolutionary pressure to learn from it and synthesise the experience into clear actionable lessons. Hearing about those experiences have personally been a lot more useful for guiding the trajectory of our company. When it comes to hearing a story about a startup that rocketed to unicorn land in a year versus the one that died in the garage, I’ll take the garage. It’s a Business, stupid People often see their startups as movements, or lifestyles, or disruptions to the status quo, or solving AI, or under some other grandiose perspective. But first and foremost, a startup is a business. That just means it should sell something that people want to buy. Like an ice cream stand. If YC has instilled one thing and only one thing in me so far, it’s the importance of looking at our company from this lens. It has been very useful to strip the pretensions around our startup and just focus on delivering a product that people want. That’s it, simple as that. This is underscored by the concept of product market fit. This has been mentioned in other posts, but I wanted to go over it again because of how strongly it has been emphasised during the last month. I didn’t truly understand the definition before YC, now I do. So, what is product market fit? The best description is from a paraphrase from Emmett Shear of Twitch(a YC company): Before product market fit it feels like you are rolling a boulder uphill, after you have product market fit it’s like you a chasing it downhill. Put differently from Dalton Caldwell(one of our group partners): You know you have product market fit when growth is no longer you biggest problem. Product market fit is the validation that you have created something that people actually want. It’s so important that multiple people have described a startup as existing in only two regimes, before PMF and after PMF. I have seen it myself in talking to founders on both sides of it. Founders before PMF sound down and unconfident. After, they are hectic and frenzied, trying to figure how to keep up with that boulder. In a sense, a startup is like a manic-depressive. Before PMF it is in the depressive stage, filled with existential angst over its place in the world, not sure if it should keep going on or not. After, it is manic, trying to productively focus its flailing energy and not harm itself in the process. Your first goal as a startup is to go from being depressive to being manic. Long Way to Go That’s the first month. It has been quite an experience thus far, even though it has been remote. I wanted to write this post while still early in the batch so that my mental narrative of the program hadn’t been consolidated in any way by hindsight and the eventual actual trajectory of Encord. Writing in the present is foggy, but history is hindsight, and thus history is hindsight bias. It’s like someone talking about a maze from a bird’s eye view versus from within the maze itself. We are in the maze now. Hopefully we will make it out. *Or some other significant number
Nov 11 2022
9 M
Why Running a Startup is Like Playing in Squid Game
This article comes from my experiences co-founding Encord. Spoilers ahead for the most-watched Netflix series of last year. Although if you are getting spoilers from this article, you probably don’t care about spoilers. It sounds odd, but there might not be a better approximation of what it’s like running a new technology company than the hit 2021 Netflix series Squid Game. The trials and tribulations faced by the contestants of the show are strangely analogous to those faced by the typical startup, albeit in a very different context. Both face exceedingly small chances of survival, with less than 1/100 VC-backed startups making it to IPO and only 1/456 contestants making it to the end of the season. Also, not unlike the show’s characters, startups often have great difficulty making money in the real world. We will go through the various challenges of Squid Game and show how these relate to practical problems facing every startup. Unlike in Squid Game, in the startup world, these challenges are not dealt with sequentially, but concurrently. This makes them that much more difficult to navigate. Red Light/Green Light (Fundraising) Going through the fundraising gauntlet can be brutal. One key reason is that investors are mostly looking for a reason to say no to your startup rather than yes. Seeing so many opportunities every day, investors have to come up with heuristics they can use to pass and quickly move on from deal to deal. The questions they ask thus act as a type of gating mechanism, where even a single answer can filter out a company for consideration. The net effect of this dynamic is that as a founder pitching, you hope you don’t say even one wrong thing to disqualify your company. It’s like playing Red Light/Green Light in Squid Game and making one tiny move after hearing “Red Light.” Didn’t give a good answer to the market size question Even after getting meetings and passing through pitches, investors will make holistic evaluations of companies before deciding to invest. Failure can happen at any point in the process(including after getting a term sheet, although this is rare). It may surprise you how many startups fail. You have not completed the challenge until the money has actually cleared into the company account. And even then, this is only the beginning of bringing your startup to success. Cookie Cutting(Product Development) There are only two fundamental requirements a product needs to fulfill for viability: it needs to do something valuable, and it needs to work. Often these requirements are in tension with each other. A natural temptation is to pepper a product with more and more features to increase its value and hopes that more people will use it. This, however, introduces development and maintenance burdens and bug potential at a faster rate than the increase in feature number. This point is demonstrated beautifully in the cookie-cutting challenge in Squid Game. The simpler the shape, the easier the challenge. The simpler the product, the easier it is to build and maintain. Simpler products break less. If you want to build an umbrella, you are opening yourself up to many more failure modes than if you build a circle. Thus, build circles, not umbrellas. When going through Y Combinator, I heard the importance of focus reiterated many times. It is critical to focus as a startup. I interpreted that this focus was for the sake of simplifying and optimizing a go-to-market motion where you can tailor your messaging and target a narrower set of companies to find initial product market fit. This is true, but I neglected the product component to the point. The less targeted and focused a product is, i.e., the more features and use cases it accommodates, the more complex the maintenance surface is and the more resources it requires to get right. Complexity makes it very easy to get lost in the morass of the product development cycle and can distract from solving other problems. For both Squid Game and startups, simplicity is key; simplicity gets you to the next round. Tug of War(Building a Team) The most important component of a company is the team, and there are two (not mutually exclusive!) strategies for making an effective team. You can hire the best and greatest, and/or you can put together a team where everyone is pulling in sync in the same direction. Again, this is demonstrated with metaphorical clarity in Squid Game with Tug of War. Our hero, Gi-Hun, finds himself in a suboptimal team playing in a game where the sum of raw strength seems the only relevant factor for success. They face up against what looks like a massively stronger team on the other side and seem doomed to failure. The one advantage, however, is that they come equipped with a strategy that they all commit to following. Dev team deploying to production. This metaphor works on two levels. Their initial strategy keeps them alive for the first burst of activity and gives them the confidence they can compete against the other ostensibly stronger team. Later, when they start waning, they also come up with an ingenious tactic to step forward and throw the other team off balance. This shows great teams don’t just follow one static plan, but also communicate and improvise in unison. As a startup navigating uncertain environments, the ability to adapt and adjust across the organization is crucial. To do this, communication channels must be open. If a team can’t communicate and coordinate, an accumulation of talent will not amount to much. Marbles(Making Difficult Choices) The most emotionally wrenching challenge from the show was the marble game. This section saw the characters make difficult sacrifices and dubious choices with respect to other characters. Although not as dramatic, this does reflect the sometimes difficult personal choices that you have to make running a startup. Social calendars get wiped out, working hours extended, and tough choices must be made. These types of trade-offs are not for everyone and really shouldn’t be. While glamourized in a current bonanza fundraising environment, startup founders often do not live healthy, well-rounded lives. I have heard no shortage of stories of founders maxing out their credit cards, moving back in with their parents, or firing their best friends. This is not to say that these choices or sacrifices are necessary(certainly, they are not preferable), but they are the simple reality for many. Squid Game captures this regrettable reality. Crossing the Bridge(Taking Risks) While effective dramatically, the least personally satisfying game of the show was bridge crossing. It was a game of pure luck, requiring no mental or physical skill. Unfortunately, this, too has an analog within the startup world. Sometimes, no matter what you do, how well you execute, and how strong your team is, you can just get unlucky. The startup world is inherently risky and comes loaded with a stack of implicit bets that you must make. Is this market going to grow? Is this solution the right one? Are you too early, or are you too late? You are not guarded against force majeure as you would be in a large robust, and established institution. There are just too many factors that are out of your control. You simply have to jump from platform to platform and hope. Squid Game(Competition) The most individually competitive game in the series is the final challenge, the eponymous squid game. With a startup, competition will, of course, be a factor, If you don’t have any competition, that is most likely a sign that you have chosen the wrong problem to solve. As Paul Graham has written: If you make anything good, you’re going to have competitors, so you may as well face that. You can only avoid competition by avoiding good ideas. All of the major tech giants that we know today started in massively competitive markets. It would have been easy to dismiss Google in the late 90s as just another search engine in an ocean of many. The iSmell, a device that could emit odors from your PC, had no competitors. But like with Squid Game, overcoming competition is only a small element of success. Based on elimination numbers, getting through Squid Game was far less about competing against others and much more about individual skill and luck. This aligns with the opinion that competition is greatly over-emphasized in the startup world as a failure mode. A startup is more likely to die from self-inflicted wounds than it is from the competition. The emphasis should always be on solving a real problem, not beating competitors. One final point is that nowhere in the Squid Game rules did it list the game as being winner-take-all. There may have been an alternative configuration where multiple contestants won the final prize and split it amongst themselves. This is how most markets actually operate. With a big enough pie, winnings can be shared. Conclusion While being a hugely entertaining show, Squid Game also offers many lessons for entrepreneurs embarking on the dangerous journey of starting a company. Perhaps with the next season, we can continue the learning and also find out more about the shadowy puppet master behind the entire operation. To get future updates on stories, subscribe here.
Nov 11 2022
7 M
Encord Named to the 2022 CB Insights AI 100 List 2022
Encord recognized for achievements in Computer Vision London, May 17, 2022 -- CB Insights today named Encord, the platform for data-centric computer vision, on its annual AI 100 ranking, a list showcasing the 100 most promising private artificial intelligence companies in the world. "This is the sixth year that CB Insights has recognized the most promising private artificial intelligence companies with the AI 100. This year's cohort spans 13 industries working on everything from recycling plastic waste to improving hearing aids," said Brian Lee, SVP of CB Insights’ Intelligence Unit. "Last year's AI 100 companies had a remarkable run, raising more than $6 billion, including 20 mega-rounds worth more than $100 million each. We’re excited to watch the companies on this year’s list continue to grow and create products and services that meaningfully impact the world around them.” “We’re honoured to not only be recognized as an innovative AI Startup but also as the only company within the computer vision bracket. The whole team has worked tirelessly from its humble beginnings to get to where we are today and for this effort to be recognized is great kudos to everyone,” said Ulrik Stig-Hansen, Co-Founder & CEO at Encord. “However, this is just the beginning and we have exciting developments on the horizon - watch this space.” Utilizing the CB Insights platform, the research team picked 100 private market vendors from a pool of over 7,000 companies, including applicants and nominees. CB Insights chose winners based on factors including R&D activity, proprietary Mosaic scores, market potential, business relationships, investor profile, news sentiment analysis, competitive landscape, team strength, and tech novelty. The research team also reviewed thousands of Analyst Briefings submitted by applicants. Backed by CRV, Y Combinator, WndrCo, Crange Venture Partners, Harpoon Ventures, and Harvard Management Corporation, Encord has partnered with the likes of world-leading healthcare institutions including King’s College London where its tool increased the efficiency of annotating pre-cancerous polyp videos by an average of 6.4x and automated 97% of labels, making the most expensive clinician 16x more efficient at labelling medical images. Encord has also worked with Memorial Sloan Kettering Cancer Center and Stanford Medical Centre where use of the platform reduced experiment duration by 80%. The CB Insights AI 100 Quick facts on the 2022 AI 100: Equity funding and deals: Since 2017, the 2022 AI 100 cohort has raised $12 billion+ from 650 investors, across 300+ equity deals. Unicorns: There are 16 companies with $1 billion+ valuations on the list. Geographic distributions: Seventy-three of the selected companies are headquartered in the US. Eight of the winners are based in the UK, followed by five in Canada. Other countries represented in this year’s list include India, Sweden, China, and Germany. About CB Insights CB Insights builds software that enables the world's best companies to discover, understand, and make technology decisions with confidence. By marrying data, expert insights, and work management tools, clients manage their end-to-end technology decision-making process on CB Insights. To learn more, please visit www.cbinsights.com. Contact: Media@cbinsights.com About Encord Encord is a leading platform for data-centric computer vision. Our solution empowers machine learning and data operations teams to annotate, manage, and evaluate training data. Encord teaches computers more effectively to do practical, real-world tasks in all future technologies. Encord removes complexity and enables data teams to work more efficiently.
Nov 11 2022
2 M
What I Learned Scaling a Start Up From 0 to 25 Employees
A lot changes when you transition from founding a startup to running a startup. When starting Encord, my co-founder Ulrik and I spent nearly a year coding from our couches before we were accepted into Y Combinator. YC is an amazing experience, and it helped us focus on growing and scaling the business. However, the work environment still wasn’t very glamorous. We changed from coding on our couches in London to coding from my parent’s dining room in California. Our team expanded by one when Rio, my parents’ Akita, decided to start spending his days with us. While Akitas are good for team morale, they don’t add that much value in terms of coding skill or strategic planning, so after three months at YC, we said goodbye to Rio and headed back to London where we began fundraising. Those early days of running a startup can be stressful because you have to wear a lot of different hats at once. There’s a lot of dynamic problem solving that has to be done. However, when it’s just you and your co-founder, it’s always clear as to who is working on what problem, and the communications channels are very efficient. So, while there’s a lot of work to be done, it gets done in an efficient way. Even after a couple of hires, the work environment remains quite similar, but, as the business begins to scale and you hire more people, it starts to change. Sourcing, interviewing, and hiring the right people for your team is challenging, and incorporating those new hires into the business also comes with its own unique set of growing pains. Below are four things I’ve learned as we’ve scaled Encord from 0 to 25 people. Hire Slowly and Deliberately When I worked in finance, I interviewed a lot of people, but it’s very different when you are interviewing and hiring for your own company, especially if it’s at an early stage. The success of the company is highly dependent on those first few hires. In a larger company with a hundred plus employees, a new hire won’t change the trajectory all that much. A hire only adds marginal incremental value whereas at a startup the early hires will have a tremendous impact on the course and future of the company. We thought carefully about both the skills we needed most and the type of people we wanted to hire. We hired slowly and deliberately. We began with the search for our founding software engineer. The decision to hire for this position first arose from our mutual agreement that we needed someone who was the best first complement to our skillset. Ulrik and I knew how to code, but we didn’t know how to design proper software systems. We needed someone who could take what we had built and properly architect it into a scalable system. At Y Combinator, we learned that the first few people hired will be the ones in charge of hiring the next 50 people, so you shouldn’t underestimate the leverage effect of those early hires. This advice resonated with us and has already had a profound impact on our staffing at Encord. So take your time. Don’t just hire indiscriminately. Make sure you get people who have the skills that you need and share your company’s values. Don’t Hire Jerks… Even if They’re Brilliant Netflix Co-Founder Reid Hastings once wrote that two factors largely shape the company culture: how the founders behave and how the team behaves on aggregate. When founding Encord, I realised that the culture of a startup does very much reflect the values of your first few hires and the way that these folks behave on a day-to-day basis. As a founder, your job is to behave in a way that you think is fitting for both yourself and your organisation. If you consistently keep yourself to this standard, the hope is that it will flow through to your employees and a great culture will arise via the combination of implicit behavioural cues and explicit values. However, as a founder, your job is also to hire a starting team that will likewise behave in a way that benefits the organisation. In other words, don’t hire jerks even if they're brilliant. We hired someone very early on who was incredibly talented, but they weren’t easy to work with and their attitude didn’t reflect the values we wanted expressed in our organisation. Ultimately, it didn’t work out. Because early hires have such an influence on the trajectory of the company, a bad hire early on can have an outsized impact on the company culture. Newly hired enthusiasts should love their job, and should be passionate enough to go the extra mile at work when it's needed. Learning that lesson early on was a good way for us to prevent potentially toxic cultures from taking over as we continue to scale. While it can be tough to get a full sense of a person's personality during the interviewing process, there are always warning signs. Ask yourself: Was this person flexible when we were scheduling the interview? Was their communication around hiring and interviewing effective and congenial? Did they brag during the interview? Was there a whiff of ego throughout the whole process? Remember, people tend to be on their best behaviour as candidates, so any attribute or characteristic you notice during hiring will likely be amplified once they start working for you. Don’t ignore warning signs, even if the person has a pristine academic background or performed brilliantly on your hiring task. If you notice off putting behaviour during the interview process, you will 100 percent notice it every day that you interact with them and so will the rest of your team. Suddenly, when faced with the day-to-day reality of working with that person, all those impressive degrees and experiences won’t seem so impressive. Don’t Compromise Expert negotiator Chris Voss has a simple story that explains what compromising – or “splitting the difference”– often looks like: A woman wants her husband to wear black shoes to match his suit. Her husband wants to wear his brown shoes because they are more comfortable. So they compromise, and he leaves the house wearing one brown shoe and one black shoe. Either outcome– a pair of brown shoes or a pair of black shoes– would have been better than the compromise. What this story illustrates is that we often think of compromising as the “morally right” thing to do, but in reality compromising often leads to unsatisfying outcomes that aren’t beneficial for either person involved in the negotiation or the business itself. It’s important to be collaborative with your co-founder (or your senior leadership if you don’t have a co-founder), but collaboration doesn’t necessarily mean doing an average of things. Everyone brings different expertise, knowledge, and opinions, and, in most cases, it’s wise to defer to the person with the stronger expertise and opinion. With hiring, for instance, everyone picks up on different signals and looks for different attributes. If I feel strongly that we should hire a certain candidate, and my co-founder feels strongly that we shouldn’t, then we should talk through the issues, and ultimately, defer to the decision of the person who has more expertise and stronger feelings. Decision making should be weighted dynamically, not equally. Ask yourself: who is the person most qualified to make this call and how strongly do they feel about it? Have a Growth Mindset Starting a business means constantly solving new problems, which is exciting. However, it also means that you’re always bad at your job because you’re always new to it. (And once you start accelling at your job, it changes, so you’re bad at it again). You’re constantly learning, and, if you think you know the answers, your business will likely remain stagnant. As a result, it’s important to have a growth mindset. At Encord, we’re learning, and we’re willing to make mistakes as long as we evaluate and update our beliefs and processes based on what we learned from those mistakes. We try to reflect and course correct in incremental ways to get better answers. When it comes to hiring and scaling, having a growth mindset has been especially important. Scaling creates a certain tension between trying to implement standardised processes to ensure the business operates smoothly and trying to stay nimble enough to be innovative and responsive. Shortly after our Series A, we hired a few people, and our organisational dynamic subtly began to change. There wasn’t a binary demarcation– the change didn’t occur at a certain amount of new hires– rather we felt it continuously, and then we realised that we needed to start formalising processes and procedures. It’s a challenge because we don’t want to be too process-driven like a large, inflexible corporation, but we also don’t want to be so ad hoc or ambiguous that our employees experience chaos trying to figure out what they’re doing and who they’re reporting to. At the moment, we’re figuring out how to design and build processes that can help the startup run itself and provide our people with guidance without hamstringing them. Striking that balance is difficult. There’s a lot of best practices out there, but you won’t know which ones apply to your company and its culture until you try it out and see if it fits. That trial and error process requires a growth mindset at every level of the organisation.
Nov 10 2022
6 M
Startup Taps Finance Micro-models for Data Annotation Automation
After meeting at an entrepreneur matchmaking event, Ulrik Hansen and Eric Landau teamed up to parlay their experience in financial trading systems into a platform for faster data labeling. In 2020, the pair of finance industry veterans founded Encord to adapt micromodels typical in finance to automated data annotation. Micromodels are neural networks that require less time to deploy because they’re trained on less data and used for specific tasks. Encord’s NVIDIA GPU-driven service promises to automate as much as 99 percent of businesses’ manual data labeling with its micromodels. “Instead of building one big model that does everything, we’re just combining a lot of smaller models together, and that’s very similar to how a lot of these trading systems work,” said Landau. The startup, based in London, recently landed $12.5 million in Series A funding. Encord is an NVIDIA Metropolis partner and a member of NVIDIA Inception, a program that offers go-to-market support, expertise and technology for AI, data science and HPC startups. NVIDIA Metropolis is an application framework that makes it easier for developers to combine video cameras and sensors with AI-enabled video analytics. The company said it has attracted business in gastrointestinal endoscopy, radiology, thermal imaging, smart cities, agriculture, autonomous transportation and retail applications. ‘Augmenting Doctors’ for SurgEase Back in 2021, the partners hunkered down near Laguna Beach, Calif., at the home of Landau’s parents, to build Encord while attending Y Combinator. And they had also just landed a first customer, SurgEase. London-based SurgEase offers telepresence technology for gastroenterology. The company’s hardware device and software enable remote physicians to monitor high-definition images and video captured in colonoscopies. “You could have a doctor in an emerging economy do the diagnostics or detection, as well as a doctor from one of the very best hospitals in the U.S.,” said Hansen. To improve diagnostics, SurgEase is also applying video data to training AI models for detection. Encord’s micromodels are being applied to annotate the video data that’s used for SurgEase’s models.The idea is to give doctors a second set of eyes on procedures. “Encord’s software has been instrumental in aiding us in solving some of the hardest problems in endoscopic disease assessment,” said SurgEase CEO Fareed Iqbal. With AI-aided diagnostics, clinicians using SurgEase might spot more things sooner so that people don’t need more severe procedures down the line, said Hansen. Doctors also don’t always agree, so it can help cut through the noise with another opinion, said Landau. “It’s really augmenting doctors,” said Landau. King’s College of London: 6x Faster King’s College of London had a challenge of finding a way to annotate images in precancerous polyp videos. So it turned to Encord for annotation automation because using highly skilled clinicians was costly on such large datasets. The result was that the micro models could be used to annotate about 6.4x faster than manual labeling. It was capable of handling about 97 percent of the datasets with automated annotation, the rest requiring manual labeling from clinicians. Encord enabled King’s College of London to cut model development time from one year to two months, moving AI into production faster. Triton: Quickly Into Inference Encord was initially setting out to build its own inference engine, running on its API server. But Hansen and Landau decided using NVIDIA Triton would save a lot of engineering time and get them quickly into production. Triton offers open-source software for taking AI into production by simplifying how models run in any framework and on any GPU or CPU for all inference types. Also, it allowed them to focus on their early customers by not having to build inference engine architecture themselves. People using Encord’s platform can train a micromodel and run inference very soon after that, enabled by Triton, Hansen said. “With Triton, we get the native support for all these machine learning libraries like PyTorch and it’s compatible with CUDA,” said Hansen. “It saved us a lot of time and hassles.”
Nov 10 2022
3 M
Announcing our $4.5 million seed round led by CRV
Today, we are thrilled to publicly launch Cord and announce our $4.5M seed round led by CRV and including the Y Combinator Continuity Fund, WndrCo, Crane Venture Partners, Harvard Management Company, and Intercom. When we first started Cord a little more than a year ago, we thought about how similar the state of AI was to the early days of computing and the internet. The potential of the technology was in full sight but the tools and processes surrounding it left much to be desired. Before starting the company, we experienced firsthand how the lack of tools to prepare quality training data was impeding the progress of building practical AI applications for everyone besides large tech companies. We wanted to de-FAANG AI by making access to quality training data available beyond Silicon Valley. With enterprises across every industry now evaluating AI as a core part of their business (over half deploying AI applications now compared to only 27% in 2019!), we think the time is right. The challenge was to avoid being another data labelling service. Other platforms have done an excellent job making sure your data gets to the right outsourced provider and back to you with newly minted bounding boxes. But what if you cannot send your data externally, perhaps for privacy or security reasons? What if you need someone with special knowledge — say, a doctor — to label the data? (A pedestrian is easily recognizable; a polyp, not so much.) What if you cannot afford to pay an expensive expert to label video frames for 100s of hours so that you have sufficient data to train your model? Or an even simpler problem: what if you get back your labelled data from an outsourced workforce and realize you also need to label ‘table’ as well as ‘chairs’ — do you send it back and pay another fee? The only effective way to solve all these problems at once was automation. Our combined experience in research and building thousands of models was the foundation for developing a solution. Cord is the only company that can automate labelling using our deeply technical approach, starting in computer vision. Our customers choose us because our novel algorithmic solution makes the annotation process orders of magnitude faster and cheaper, while allowing them to retain 100% control of their data. Stanford Medicine’s Division of Nephrology chose Cord because we increase efficiency without sacrificing accuracy. In one lab, they conducted experiments five times faster while processing three times the number of images. Another healthcare use case came out of our published research alongside King’s College London. We worked with collaborators Dr. Bu Hayee and Dr. Mehul Patel to annotate colonoscopy videos, which have relevant applications in cancer incidence. Since “colonoscopy” does not in itself inspire further reading, we’ll skip to the result: compared to open-sourcing labelling standard CVAT, Cord demonstrated an up to 16x increase in labelling efficiency, with our micro-models achieving > 97% accuracy for bounding box placement. An editorial in Endoscopy International Open cited our paper when discussing the potential of AI to reduce cancer-related mortality, reminding readers that “time is life!” By using Cord, doctors can focus their time on saving patients rather than drawing boxes. Our team at Cord has been fortunate to work not only with healthcare innovators, but also with teams building applications in agriculture, autonomous vehicles, and retail. We are working at the vanguard of the latest AI technologies to facilitate the most efficient training data creation possible, regardless of the industry. What is next? First and foremost, we are hiring. Take a look at our open roles here or email us at hello@cord.tech. We have an ambitious vision and we need to build a world class team to take advantage of this enormous opportunity. We’re humbled to already work with some of the best, including lead investor Anna Khan, General Partner, at CRV and our core group of experienced, supportive investors. And of course, if you are building AI applications, we would love to work together. Take a look at our article Label a Dataset with a Few Lines of Code to get a flavour for algorithmic labelling on real-world examples, and start using our platform today by booking a demo here. Eric & Ulrik
Nov 10 2022
3 M
Software To Help You Turn Your Data Into AI
Forget fragmented workflows, annotation tools, and Notebooks for building AI applications. Encord Data Engine accelerates every step of taking your model into production.