Apache Kafka - sgml/signature GitHub Wiki
Examples
- https://github.com/scholzj/kafka-test-apps/blob/main/kafka-producer.yaml
- https://www.stardog.com/labs/blog/stream-reasoning-with-stardog/
- https://towardsdatascience.com/kafka-python-explained-in-10-lines-of-code-800e3e07dad1
CLI
- https://www.confluent.io/blog/using-apache-kafka-command-line-tools-confluent-cloud/
- https://greenplum.docs.pivotal.io/streaming-server/1-3-6/kafka/load-from-kafka-example.html
Glossary
Flink
- https://cwiki.apache.org/confluence/display/Flink/FLIP-188%3A+Introduce+Built-in+Dynamic+Table+Storage#FLIP188:IntroduceBuiltinDynamicTableStorage-Retention
- https://www.alibabacloud.com/blog/introduction-to-unified-batch-and-stream-processing-of-apache-flink_601407
Integration
- https://www.confluent.io/blog/how-to-share-kafka-connectors-on-confluent-hub/
- https://docs.confluent.io/kafka-connectors/github/current/configuration_options.html
- https://docs.confluent.io/kafka-connectors/aws-lambda/current/lambda_sink_connector_config.html
- https://medium.com/geekculture/heroku-integration-capabilities-the-mini-guide-b8ce745faad1
- https://www.confluent.io/hub/castorm/kafka-connect-http
- https://docs.confluent.io/kafka-connect-aws-cloudwatch-logs/current/overview.html
- https://docs.confluent.io/kafka-connect-sftp/current/source-connector/csv_source_connector.html
- https://rmoff.net/2021/01/11/running-a-self-managed-kafka-connect-worker-for-confluent-cloud/
- https://developer.salesforce.com/blogs/2016/05/streaming-salesforce-events-heroku-kafka
- https://dzone.com/articles/kafka-for-xml-message-integration-and-processing
Use Cases
- https://mozilla-version-control-tools.readthedocs.io/en/latest/hgmo/replication.html
- http://www.liferaysavvy.com/2021/07/liferay-tomcat-access-logs-to-kafka.html
- https://www.oreilly.com/library/view/mastering-kafka-streams/9781492062486/ch01.html
- https://www.confluent.io/kafka-summit-sf18/kafka-as-an-eventing-system-to-replatform-a-monolith-into-microservices/
- https://towardsdatascience.com/getting-started-with-apache-kafka-in-python-604b3250aa05
- https://blog.bosch-si.com/developer/eclipse-hono-supporting-apache-kafka-for-messaging/
- https://github.com/eclipse/hono/issues/8
Concepts
- https://www.confluent.io/de-de/blog/enabling-exactly-once-kafka-streams/
- https://dev.to/heroku/what-is-a-commit-log-and-why-should-you-care-pib
- https://preparingforcodinginterview.wordpress.com/2019/10/04/kafka-3-why-is-kafka-so-fast/
- https://cwiki.apache.org/confluence/display/KAFKA/A+Guide+To+The+Kafka+Protocol
- https://www.oreilly.com/library/view/streaming-architecture/9781491953914/ch04.html
- https://docs.datastax.com/en/kafka/doc/kafka/kafkaHowMessages.html
Internals
- https://kafka.apache.org/cve-list
- https://jaceklaskowski.gitbooks.io/apache-kafka/content/kafka-tools-DumpLogSegments.html
- https://logging.apache.org/log4j/2.x/log4j-users-guide.pdf
- http://events17.linuxfoundation.org/sites/events/files/slides/developing.realtime.data_.pipelines.with_.apache.kafka_.pdf
Refactoring
- https://www.moengage.com/blog/kafka-at-moengage/
- https://www.confluent.io/es-es/blog/kafka-without-zookeeper-a-sneak-peek/
- https://www.confluent.io/blog/apache-flink-apache-kafka-streams-comparison-guideline-users/
Security
- https://stackoverflow.com/questions/60625612/how-does-one-use-kafka-with-openid-connect
- https://developer.ibm.com/tutorials/kafka-authn-authz/
Videos
VS
One of Kafka's core features is the partitioning of data by means of a partition key, which can be used to select data for which the order must be maintained and data which can be processed in parallel.
A Kafka cluster consists of brokers that coordinate the writing (and reading) of data to permanent storage. With Kafka, every message is stored. Communicating via permanent storage decouples the send and receive operations from each other
The key benefits of Kafka are its scalability, its ordering guarantees, its wide-scale adoption, and wealth of commercial service offerings.
All message types are brokered. This means that messages can be delivered even if the recipient was disconnected for a moment.
The communication is also decoupled in terms of time so that direct feedback from the recipient to the sender of a message is no longer possible.